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® Recombinant polypeptides and their uses, mduslng assay for aids virus. 

© Novel recombinant HTLV-III envelope proteins denoted 
RIO, PB1. 590, and KH1, are useful in the diagnosis, pro- 
phylaxis or therapy of AIDS. Protein R10 is a 95 kD fusion 
protein; protein PB1 is a 26 kD fusion protein; protein 690 Is 
en 86 kD fusion protein; and protein KH1 is a 70 kD fusion 
protein. These proteins are considered to be especially useful 
to prepare vaccines for the HTLV-III virus. 
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RECOMBINANT POLYPEPTIDES AND THEIR USES, 
INCLUDING ASSAY FOR AIDS VIRUS 
This invention relates to recombinant polypeptides 
and their uses, including assay for AIDS virus. The 
5 virus which causes AIDS (acquired immune deficiency 
syndrome) has been identified as human T-cell 
lymphotropic virus type III (HTLV-III) , lymphadenopathy- 
associated virus (LAV) or AIDS-associated retrovirus 
(ARV) ; see Popovic et al, Science 2Z4 (1984) 497-500* 
10 The virus has been designated HIV (human immunodeficiency 
virus) by the International Committee on Taxonomy of 
Virus. 

The virus displays tropism for the OKT4 + lymphocyte 
subset; see Klatzmann et al, Science 225 (1984) 59-63. 

15 Antibodies against HTLV-III proteins in the sera of 

most AIDS and AIDS-related complex (ARC) patients, and in 
asymptomatic people infected with the virus (Sarngadharan 
et al, Science 224 (1984) 506-508) have made possible the 
development of immunologically based tests that detect 

20 antibodies to these antigens. These tests are used to 

limit the spread , of HTLV-III through blood transfusion by 
identifying blood samples of people infected with the 
virus. Diagnostic tests currently available commercially 
use the proteins of inactivated virus as antigens. 

25 in addition to allowing new approaches for 

diagnosis, recombinant DNA holds great promise for the 
development of vaccines against both bacteria and 
viruses? see Wilson, Bio/Technology 2 (1984) 29-39. 
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The most widely employed organisms to express recom- 
binant vaccines have been E. coli , S. cerevisiae 
and cultured mammalian cells. For example, subunit 
vaccines against foot and mouth disease (Kleidj D.G. f 
Yansura, D. , Smail, B. , Dowbenko , D. , Moore, D.M. , 
Brubman, M.J., McKercher , F . D . , Morgan, D.O. , Robertson, 
B.H. and Bachrach, H.L. [1981] Science Ilk : 1125-1129) 
and malaria (Young, J.F., Hockmeyer, W.T., Gross, M. , 
Ripley Ballou, W. , Wirtz, R.A. , Trosper, J.H., Beaudoin, 
R.L., Hollingdale, M.R., Miller, L.M. , Diggs , C.L. 
and Rosenberg, M. [1985] Science 228:958-962) have 
been synthesized in E. coli . Other examples are 
hepatitis B surface antigen produced in yeast (McAleer , 
W.J., Buynak, E.B. , Maigetter, R. Z. , -Wamoler , D.E. , 
Miller, W.J. and Hilleman, M.R. [1984] Nature 307: 
178-180) and a herpes vaccine produced in mammalian 
cells (Berman, P.W. , Gregory, T. , Chase, D. and La sky , 
L.A. [1984] Science 227:1490-1492). 

There is a real need at this time to develop a 
vaccine for AIDS. No such vaccine is known to exist. 

Brief Summary of the Invention 

The subject invention concerns . novel recombinant 
HTLV-III proteins and the uses thereof. More specifi- 
cally, the subject invention concerns four novel 
recombinant HTLV-III envelope proteins v?hich can be 
used in the diagnosis, prophylaxis or therapy of AIDS. 
Further, the recombinant HTLV-III envelope protein 
fragments of the invention can be used to stimulate 
a lymphocyte proliferative response in HTLV-III in- 
fected humans. This then would stimulate the immune 
system to respond to HTLV-III in such individuals and," 
therefore, the. envelope protein fragments can provide 
protection and be of therapeutic value. 
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These novel proteins are encoded on bacterial plasmids 
which can be used to transform suitable hosts, for 
example, E. coli, using standard procedures. 
In the accompanying drawings: 

Figures 1A and IB are sequential flow charts of the. 
construction, from plasmid pBGl, of plasmid pREV2.2 which 
is used to construct vectors encoding novel proteins; 

Figure 2 is a diagram of plasmid pREV2.2 and also of 
the multiple cloning site; and 

Figure 3 is a schematic representation of the 
HTLV-III envelope gene and also of novel recombinant 
proteins obtained therefrom. 

Expression vector plasmid pREV2.2 was constructed 
from plasmid pBGl by the route shown in Figure 1 of the 
drawings. In the product, the hatched region represents 
TrpA trasc. terminator and the dotted region represents 
multiple cloning sites for enzymes Nrul, Mlul, Eco RV , 
Clal, BamHI, Sai l/ Hindlll, Sma l, StsI, Eco RI . 

Plasmid pRIO contains approximately 1275 base pairs 
of DNA encoding the HTLV-III env gene from essentially 
the Kpn l site to the Bgl ll . This plasmid in a suitable 
bacterial host, e.g. E. coli, can be used to produce the 
novel recombinant HTLV-III 95 kD fusion protein denoted 
RIO. The amino-acid sequence of fusion protein RIO is 
shown in Table 8; the DNA sequence encoding this protein 
is shown in Table 8A. 

Plasmid pPBl contains approximately 540 base pairs 
of DNA encoding essentially the HTLV-III env gene from 
the PvuII site to the Bgl ll site. This plasmid in a 
suitable host, e.g. E. coli , can be used to produce the : 
novel recombinant HTLV-III 26 kD fusion protein denoted 
PB1. The amino-acid sequence of fusion protein PB1 is 
shown in Table 9; the DNA sequence encoding this protein 
is shown in Table 9A. 
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Plasmid p590 contains approximately 1055 base pairs 
of DNA encoding essentially the HTLV-III env gene 
from the PvuII site to the Hindlll site. This plasmid 
in a suitable host, e.g., E. coli , can be used to pro- 
duce the novel recombinant HTLV-III 86 kD protein 
denoted 590. The amino acid sequence of fusion 
protein 590 is shown in Table 10; the DNA sequence 
encoding this protein is shown in Table 10A. 

Plasmid pKHl contains approximately 1830 base 
pairs of DNA encoding essentially the HTLV-III env 
gene from the Kpn l site to the Hin dlll site. This 
plasmid in a suitable host, e.g., E. coli , can be used 
to produce the novel recombinant HTLV-III 70 kD protein 
denoted KH1 . The amino acid sequence of fusion protein 
KH1 is shown in Table 11; the DNA sequence encoding 
this protein is shown in Table 11A. 

Plasmid pBGl is deposited in the E. coli host 
MS371 with the Northern Regional Research Laboratory 
(NRRL) , U.S: Department of Agriculture, Peoria, Illi- 
nois, USA. It is in the permanent collection of this 
repository. E. coli MS371(pBGl), NRRL B-15904, was 
deposited on Nov. 1, 1984. E. coli MS371, NRRL B- 
15129, is now available to the public, E. coli SG20251, 
NRRL B-15918, was deposited on Dec. 12, 1984. 



Other relevant NRRL deposits, their 
deposit dates and numbers, are as follows: 

Cul lure Ac cession No. Date of Deposit 

E. coli JM103(pREV2.2) NRRL B-18091 July 30, 1986 

E. coli SG20251(pR10) NRRL B-18093 July 30 , 1986 

E. coli SG20251(pPBl) NRRL B-18092 July 30, 1986 

E. coli SG20251(p590) NRRL B-18094 July 30, 1986 

E. coli CAG629(pKHl) NRRL B-18095 July 30, 1986 
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The novel HTLV-I1I proteins of the subject 
invention can be expressed in Saccharomyces cerevisiae 
using plasrnids containing the inducible galactose 
promoter from this organism (Broach, J.R. , Li , Y . , 
Wu t L.C. and Jayaram, M. in Experimental Manipulation 
of Gene Expression [1983] p. 83, ed. M. Inouye , 
Academic Press). These plasrnids are called YEp51 and 
YEp52 (Broach, J.R. et al. [1983]) and- contain the 
E. coli origin of replication, the gene for . B- lactamase . 
the yeast LEU2 gene, the 2 urn origin of replication and 
the 2 ym circle REP 3 locus. Recombinant gene expression 
is driven by the yeast GAL 10 gene promoter. 

Yeast promoters such as galactose and alcohol 
dehydrogenase (Bennetzen, J.L. and Hall, B.D. [1982] 
J. Biol. Chem. 257:3018; Ammerer , G. in Methods in 
Enzymology [1983] Vol. 101, n. 192), ohosphoglycerate 
kinase (Derynck. R. , Hitzeroan, R.A. . Gray, P.W. , 
Goeddel, D.V. , in Experimental Manipulation of Gene 
Expression [1983] p. 247, ed. M. Inouye, Academic 
Press) , triose phosphate isomerase (Alber , T. and 
Kawasaki, G. [1982] J. Molec. and Applied Genet. 
1:419), or enolase (Innes, M.A. et al. [1985] Science 
226:21) can be used. 



-6- 



02S5T9O 



The genes disclosed herein can be expressed 
in simian cells. When the genes encoding these proteins 
are cloned into one of the plasmids as described in 
Okayama and Berg (Okayama, H. and Berg, P. [1983] 
Molec. and Cell. Biol. 3:280) and references therein, 
or COS cells transformed with these plasmids, synthesis 
of HTLV-1II proteins can be detected immunologically. 

Other mammalian cell gene expression/protein 
production systems can be used. Examples of other such 
systems are the vaccinia virus expression system 

(Moss, B. {1985] Immunology Today 6:243; Chakrabarti, S. 

Brechling, K. , Moss, B. [1985] Molec. and Cell. Biol. 

5:3403) and the vectors derived from murine retroviruses 

(Mulligan. R.C. in Experimental Manipulation of Gene 

Expression [1983] p. 155. ed. M. Inouye , Academic Press) 
The HTLV-II1 proteins of the subject invention, 

car. be chemically synthesized by solid ohase peptide 

synthetic techinques such as BOC and FMOC (Merrifield. 

R.B. [1963] J. Amer. Chem. Soc . 85:2149; Chang, C. 

and Meienhofer, J. [1978] Int. J. Peptide Protein Res. 

.11:246). 

As is well known in the art, the amino acid 
sequence of a protein is determined by the nucleotide 
sequence of the DNA. Because of the redundancy of the 
genetic code, i.e., more than one coding nucleotide 
triplet (codon) can be used for most of the amino 
acids used to make proteins , different nucleotide 
sequences can code for a particular amino acid. Thus, 
the genetic code can be depicted as follows: 
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Phenylalanine (Phe) 


TTK 


Histidine (H-ie^ 


CAY 


Leucine (Leu) 


XTY 


Glutamine (Gin} 


CA 1 


Isoleucine (lie) 


ATM 


Asoaracine /Aen^ 


AAV 


Methionine (Met) 


ATG 


Lvsine ^T\7Q^ 


A A 7 
AAJ 


Valine (Val) 


GTL 


Asparcic acia (Asu) 


GAK 


Serine (Ser) 


QRS 


Glutamic acid (Glu) ' 


CAJ 


Proline (Pro) 


CCL 


Cysteine (Cys): 


TGK 


Threonine (Thr) 


ACL 


Tryptophan (Trp) 


TGG 


Alanine (Ala) 


GCL 


Arginine (Arg) 


WGZ 


Tyrosine (Tyr) 


TAK 


Glycine (Gly) 


GGL 


Termination Signal 


TAJ 






Termination Signal 


TGA 







Key: Each 3-letter deoxynucleotide triplet corresponds 
to a trinucleotide of mRNA , having a 5* -end on the 
left arid a 3'-end on the right. All DNA sequences 
given herein are those of the strand whose sequence 
corresponds to the mRNA sequence, with thymine substi- 
tuted for uracil. The letters stand for the purine or 
pyrimidine bases forming the deoxynucleotide: sequence . 



A 




adenine 


G 




guanine 


C 




cytosine 


T 




thymine 


X 




T or C if Y is A or G 


X 




C if Y is C or T 


Y 




A, G, C or T if X is C 


Y 




A or G if X is T 


W 




C or A if Z is A or G 


w 




C if Z is C or T 


z 




A, G, C or T if W is C 


z 




A or G if W is A 



QR = TC if S is A, G, C or T; alternatively QR = 
AG if S is T or C 
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J = A or G. 

K = T or C 

L = A, T, C or G 

K - A, C or T 

5 

The above shows that the novel amino acid sequences 
of the HTLV-III proteins of the subject invention can 
be prepared by nucleotide sequences other than those 
disclosed herein. Functionally equivalent nucleotide 

10 sequences encoding the novel amino acid sequences of 

these HTLV-III proteins, or fragments thereof having 
HTLV-III antigenic or immunogenic or therapeutic 
activity, can be prepared by known synthetic procedures. 
Accordingly, the subject invention includes such 

15 functionally equivalent nucleotide sequences. 

Thus the scope of the subject invention includes 
not only the specific nucleotide sequences depicted 
herein, but also all equivalent nucleotide sequences 
coding for molecules with substantially the same 

20 HTLV-III antigenic or immunogenic or therapeutic 

activity. 

Further, the scope of the subject invention is 
intended to cover not only the specific amino acid 
sequences disclosed, but also similar sequences coding 

25 for proteins or protein fragments having comparable 

ability to induce the formation of and/or bind to 
specific HTLV-in antibodies. 

The term "equivalent' 1 is being used in its ordinary 
patent usage here as denoting a nucleotide sequence which 

30 performs substantially as the nucleotide sequence iden- 

tified herein to produce molecules with substantially the 
same HTLV-III antigenic or immunogenic or therapeutic 
activity in essentially the same kind of hosts. Within 
this definition are subfragments which have HTLV-III 

35 antigenic or immunogenic or therapeutic activity. 
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As disclosed above, it is well within the skill 
of those in the genetic engineering art to use the 
nucleotide sequences encoding HTLV-III antigenic or 
immunogenic or therapeutic activity of the subject 
invention to produce HTLV-III proteins via microbial 
processes. Fusing the sequences into an expression 
vector and transforming or transfecting into hosts , 
either eukaryotic (yeast or mammalian cells) or 
prokaryotic (bacterial cells), are standard procedures 
used in producing other well-known proteins, e.g., 
insulin, interferons, human growth hormone , IL- 1 , IL-2, 
and the like. Similar procedures, or obvious modifica- 
tions thereof, can be employed to prepare HTLV-III 
proteins by microbial means or tissue-culture technology 
15 in accord with the subject invention. 

The nucleotide sequences disclosed herein can be 
prepared by a "gene machine" by procedures well known 
in the art. This is possible because of the disclosure 
of the nucleotide sequence. 
20 The restriction enzymes disclosed can be purchased 

from Bethesda Research Laboratories, Gaithersburg , MD , 
or New England Biolabs , Beverly, MA. The enzymes are 
used according to the instructions provided by the 
supplier. 

25 The various methods employed in the preparation 

of the plasmids and transformation of host organisms 
are well known in the art. These procedures are all 
described in Maniatis, T. , Fritsch, E.F., and Sambrook . 
J- (1982) Molecular Cloning ; A Laboratory Manual . 
Cold Spring Harbor Laboratory, New York.- Thus, it 
is within the skill of those in the genetic engineering 
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art to extract DNA from microbial cells , perform 
restriction enzyme digestions, electrophorese DNA 
fragments, tail and anneal plasmid and insert DNA, 
ligate DNA , transform cells, e.g., E. coli cells, 
prepare plasmid DNA, electrophorese proteins, and 

sequence DNA. 

Immunochemical assays employing the HTLV-III 
proteins of the invention can take a variety of forms. 
The preferred type is a solid phase immunometric assay, 
in assays of this type, anHTLV-III protein is immobilized 
on a solid phase to form an antigen- immunoadsorbent. 
The immunoadsorbent is incubated with the sample to be 
tested. After an appropriate incubation period, the 
immunoadsorbent is separated from the sample and 
labeled ant i- (human IgG) antibody is used to detect 
human anti-HTLV-III antibody bound to the immunoadsor- 
bent. The amount of label associated with the immuno- 
adsorbent can be comoared to positive and negative 
controls to assess the presence or absence of anti- 
HTLV-III antibody. 

The immunoadsorbent can be prepared by adsorbing 
or coupling a purified HTLV-III protein to a solid 
phase. Various solid phases can be used, such as 
beads formed of glass, oolystyrene, polypropylene, 
dextran or other material. Other suitable solid phases 
include tubes or plates formed from or coated with 

these materials. 

The HTLV-III proteins can be either covalently or 
non-covalently bound to the solid phase by techniques 
such as covalent bonding via an amide or ester linkage 
or adsorption. After the HTLV-III protein is affixed 
to the solid ohase, the solid phase can be post-coated 
with an animal protein, e.g. . 3% fish gelatin. This 
provides a blocking protein which reduces nonspecific 
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adsorption of protein to the iirnnunoadeorbent surface . 

The immuno adsorbent is then incubated with the * 
sample to be tested for anti-HTLV-III antibody. In ' 
blood screening, blood plasma or serum is used. The 
plasma or serum is diluted with normal animal plasma 
or serum. The diluent plasma or serum is derived 
from the same animal species that is the source of 
the anti- (human IgG) antibody. The preferred anti- 
(human IgG) antibody is goat anti-(human IgG) antibody . ' 
Thus, in the preferred format, the diluent would be 
goat serum or plasma. 

The conditions of incubation, e.g., pH. and tempera- 
ture, and the duration of incubation are not crucial. 
These parameters can be optimized by routine experimen- 
tation. Generally, the incubation will be run for 1-2 
hr at about 45 °C in a buffer of oH 7-8. 

After incubation, the immunoadsorbent and the 
sample are separated. SeDaration can be accomplished 
by any conventional separation technique such as 
sedimentation or centrifugation . The immunoadsorbent 
then may be washed free of samole to eliminate any 
interfering substances. 

The immunoadsorbent is incubated with the labeled 
anti- (human IgG) antibody (tracer) to detect human 
25 antibody bound thereto. Generally the immunoadsorbent 
is incubated with a solution of the labeled anti- (human. 
IgG) antibody which contains a small amount (about 1Z) 
of the serum or plasma of the animal species which 
serves as the source of the anti- (human IgG) antibody.-.. 
Anti- (human IgG) antibody can be obtained from any 
animal source. However, goat anti- (human IgG) antibody 
is preferred. The anti- (human IgG) antibody can be an 
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antibody against the Fc fragment of human IgG, for 
example, goat an ti- (human IgG) Fc antibody. 

The anti- (human IgG) antibody or ant i- (human 
lgG)Fc can be labeled with a radioactive material 
such as 125 iodine; labeled with an optical label, such 
as a fluorescent material; or labeled with an enzyme 
such as horseradish peroxidase. The anti-human antibody 
can also be biotinylated and labeled avidin used to 
detect its binding to the immunoadsorbent. 

After incubation with the labeled antibody, the 
immunoadsorbent is separated from the solution and 
the label associated with the immunoadsorbent is 
evaluated. Depending upon the choice of label, the 
evaluation can be done in a variety of ways. The label 
may be detected by a gamma counter if the label is a 
radioactive gamma emitter, or by a fluorimeter, if the 
label is a fluorescent material. In the case of an 
enzyme, label detection may be done colorimetrically 
employing a substrate for the enzyme. 

The amount of label associated with the immuno- 
adsorbent is compared with positive and negative 
controls in order to determine the presence of anti- 
HTLV-III antibody. The controls are generally run 
concomitantly with the sample to be tested. A positive 
control is a serum containing antibody against HTLV-III; 
a negative control is a serum from healthy individuals 
which does not contain antibody against HTLV-III. 

For convenience and standardization, reagents 
for the performance of the immunometric assay can be 
assembled in assay kits. A kit for screening blood, 
for example, can include: 

(a) an immunoadsorbent, e.g., a polystyrene bead 
coated with an HTLV-III protein; 
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(b) a diluent for the serum or plasma sample, 
e -g.i normal goat serum or plasma; 

(c) an anti-(human IgG) antibody, e.g., goat 
anti- (human IgG) antibody in buffered, aqueous 
solution containing about 1Z goat serum or 
plasma; 

(d) a positive control, e.g., serum containing 
antibody against at least one of the novel 
HTLV-III proteins; and 

(e) a negative control, e.g., pooled sera from 
healthy individuals which does not contain 
antibody against at least one of the novel 
HTLV-III proteins. ' 

If the label is an enzyme, an additional element 

15 of the kit can be the substrate for the enzyme. 

Another type of assay for anti-HTLV-Ill antibody 
is an antigen sandwich assay. In this assay, a labeled 
HTLV-III protein is used in Dlace of anti- (human IgG) 
antibody to detect anti-HTLV-III antibody bound to the 

20 immunoadsorbent. The assay is based in principle on the 

bivalency of antibody molecules. One binding site of 
the antibody binds the antigen affixed to the solid 
phase; the second is available for binding the labeled 
antigen. The assay procedure is essentially the same 

25 as described for the iiranunometric assay except that 

after incubation with the sample, the imraunoiidsorbent 
is incubated with a solution of labeled HTLV-III protein. 
HTLV-III proteins can be labeled with radioisotope, 
an enzyme, etc. for this type of assay. 

30 In a third format, the bacterial protein, protein A, 

which binds the Fc segment of an IgG molecule without 
interfering with the antigen-antibody interaction can 
be used as the labeled tracer to detect anti-HTLV- 
antibody adsorbed to the immunoadsorbent. Protein A 

35 



-14- 

can be readily labeled with a radioisotope, enzyme or 
other detectable soecies. 

Immunochemical assays employing an HTLV-III 
protein have several advantages over those employing 

5 a whole (or disrupted) virus. Assays based upon an 

HTLV-III protein will alleviate the concern over growing 
largs quantities of infectious virus and the inherent 
variability associated with cell culturing and virus 
production. Further, the assay will help mitigate the 

10 real or perceived fear of contracting AIDS by technicians 
in hospitals, clinics and blood banks who perform the 
test. 

Vaccines of the HTLV-III proteins, disclosed 
herein, and variants thereof having antigenic properties, 
15 can be prepared by procedures well known in the art. 

For example, such vaccines can be prepared as injectables, 
e.g., liquid solutions or suspensions. Solid forms, 
for solution in, or suspension in, a liquid prior to 
injection also can be prepared. Optionally, the 

20 preparation also can be emulsified. The active anti- 

genic ingredient or ingredients can be mixed with excipi- 
ents which are pharmaceutical^ acceptable and compatible 
with the active ingredient. Examples of suitable 
excipients are water, saline, dextrose, glycerol, 

25 ethanol, or the like, and combinations thereof. In 
addition, if desired, the vaccine can contain minor 
amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, or adjuvants 
which enhance the effectiveness of the vaccine. The 

30 vaccines are conventionally administered parenterally , 
by injection, for example, either subcutaneously or 
intramuscularly. Additional formulations which are 
suitable for other modes of administration include 
suppositories and, in some cases, oral formulations. 
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For suppositories, traditional binders and carriers 
include, for example, polyalkalene glycols, or trigly- * 
cerides. Suppositories can be formed from mixtures : 
containing the active ingredient in the range of about • 
5 0.5% to about 10%, preferably about 1 to about 2% . 

Oral formulations can include such normally employed 
excipients as, for example, pharmaceutical grades of 
manitol, lactose, starch, magnesium steaf ate , sodium 
saccharine, cellulose, magnesium carbonate and the 
10 like. These compositions can take the form of solu- 
tions, suspensions, tablets, pills, capsules, sustained 
release formulations or powders and contain from about 
10% to about 95% of active ingredient, preferably from 
about 25% to about 70%. 
15 The proteins, can be formulated into the vaccine 

as neutral or salt forms. Pharmaceutical^ acceptable 
salts include the acid addition salts (formed with the 
free amino groups of the peptide) and which are formed ■ 
with inorganic acids such as, for example, hydrochloric 
or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic, and the like. Salts 
formed with the free carboxyl groups also can be 
derived from inorganic bases such as, for example , 
sodium, potassium, ammonium, calcium, or ferric hy- 
25 droxides , and such organic bases as isopropylamine , 

trimethylamine, 2-ethylamino ethanol, histidine , pro- 
caine , and the like. 

The vaccines are administered in a manner compati- ^ 
ble with the dosage formulation, and in such amount as 
will be therapeutically effective and immunogenic. The 
quantity to be administered depends on the subject to 
be treated, capacity of the subject's immune system to 
synthesize antibodies, and the degree of protection 
desired. Precise amounts of active ingredient required 

35 
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to be administered depend on the judgment of the 
practitioner and are peculiar to each individual. 
However, suitable dosage ranges are of the order of 
about several hundred micrograms active ingredient 
per individual. Suitable regimes for initial admini- 
stration and booster shots are also variable, but are 
typified by an initial administration followed in one 
or two week intervals by a subsequent injection or 
other administration. 

HTLV-III is known to undergo amino acid. sequence 
variation, particularly in the envelope gene (Starcich, 
B.R. [1986] Cell 45:637-648; Hahn, B.H. et al. [1986] 
Science 232:1548-1553). Over 100 variants have been 
analyzed by molecular cloning and restriction enzyme 
recognition analysis, and several of these have been 
analyzed by nucleotide sequencing. Some of these are 
the HTLV-III isolates known as RF (Popovic , M. et al. 
[1984] Science 224:497-500), WMJ-1 (Hahn, B.H. et al. 
[1986] Science 232:1548-1553), LAV (Wain-Hobson , S. 
et al. [1985] Cell 40 : 9-17), and ARV-2 (Sanchez- 
Pescador, R. et al. [1985] Science 227:484-492). 
Although the subject invention describes the sequence 
from one HTLV-III isolate, the appropriate envelope 
regions of any HTLV-III isolate can be produced using 
the procedures described herein for preparing R10, 
PB1, 590, and KH1 . The HTLV-III proteins from 
different viral isolates can be used in vaccine 
preparations, as disclosed above, to protect 
against infections by different HTLV-III 
isolates. Further, a vaccine preparation can be 
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made using more than one recombinant . antigenic protein 
from more than one HTLV-II1 isolate to prqvide 
immunity and thus, give better protection against 
AIDS. 

Following are examples which illustrate the 
process of the invention, including the best mode. 
These examples should not be construed as limiting. 
All solvent mixture proportions are by volurne unless 
10 otherwise noted. 

Example l—Construction of plasmid pREV2. 2 

The pREV2.2 plasmid expression vector was con- 
structed from plasmid pBGl . Plasmid pBGl can be 

15 isolated from its E. coli host by well known proce- 

dures, e.g., using cleared lysate-isopycnic density 
gradient procedures, and the like. Like pBGl , pREV2.2 
expresses inserted genes behind the E. coli promoter. 
The differences between pBGl and pREV2.2 are the 

20 following: 

1. pREV2.2 lacks a functional replication of 
plasmid ( rop ) Drotein.. 

2. pREV2.2 has the trp A transcription terminator- 
inserted into the Aatll site. This sequence 

25 insures transcription termination of over- 

expressed genes. 
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3. pREV2.2 has genes to provide resistance to 
ampicillin and chloramphenicol , whereas pBGl 
provides resistance only to ampicillin. 

4. pREV2.2 contains a sequence encoding sites for 
several restriction endonucleases . 

The following procedures* shown in Figure 1 of 
the drawings , were used to make each of the four 
changes listed above: 

la. 5 ug of plasmid pBGl was restricted with Ndel 
which gives two fragments of approximately 
2160 and 3440 base pairs, 
lb. 0.1 ug of DNA from the digestion mixture, after 
inactivation of the Nde l , was treated with 
T4 DNA ligase under conditions that favor 
intramolecular ligation (200 yl reaction 
volume using standard T4 ligase reaction 
conditions [New England Biolabs, Beverly, 
MA]). Intramolecular ligation of the 3440 
base pair fragment gave an ampicillin resistant 
plasmid. The ligation mixture was transformed 
into the recipient strain E. coli JM103 
(available from New England Biolabs) and 
ampicillin resistant clones were selected 
by standard procedures. 
1c. The product plasmid, pBGlAN, where the 2160 
base pair Nde l fragment is deleted from pBGl , 
was selected by preparing plasmid from 
ampicillin resistant clones and determining 
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the restriction digestion patterns withNdel 
and Sail (product fragments approximately 1790 
and 1650). This deletion inactivates the rop 
gene that controls plasmid replication. 

5 ug of pBGUN was then digested with EcoRI and 
Bell and the larger, fragment, approximately 2455 
base pairs, was isolated. 

A synthetic double stranded fragment was prepared 
by the procedure of Itakura et al. (Itakura, K. , 
Rossi, J.J. and Wallace, R.B. [1984] Ann. Rev. 
Biochem. 53:323-356, and references therein) with 
the structure shown in Table 1. This fragment 
has Bell and EcoRI sticky ends and contains recog- 
nition sequences for several restriction endonu- 
cleases. 

0.1 ug of the 2455 base pair EcoRI -Bell fragment 
and 0.01 yg of the synthetic fragment were joined 
with T4 DNA ligase and competent cells of strain 
JM103 were transformed. Cells harboring the 
recombinant plasmid, where the synthetic fragment 
was inserted into pBGlAN between the Bel l and EcoRI 
sites, were selected by digestion of th^ plasmid 
with Hpa l and Eco RI . The diagnostic fragment sizes 
are approximately 2355 and 200 base pairs. This 
plasmid is called pREVl. 

5 pg of pREVl were digested with Aatll, which 
cleaves uniquely. 

The double stranded fragment shown in Table 2 was 
synthesized. This fragment has Aatll sticky ends 
and contains the tro A transcription termination 
sequence . 

0.1 yg of Aatll digested pREVl was ligated with 
0.01 yg of the synthetic fragment in a volume 
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of 20 »l using T4 UNA ligase. 

2g. Cells of strain JM103, made competent, were tram 
formed and ampicillin resistant clones selected. 

2h. Using a Kpnl , EcoRI double restriction digest 
of plasmid isolated from selected colonies, 
a cell containing the correct construction 
was isolated. The sizes of the Kpnl , EcoRI 
generated, fragments are approximately 2475 and 
80 base pairs. This plasmid is called pREVITT 
and contains the troA transcription terminator. 

3a. 5 ug of p REV ITT , prepared as disclosed above 
(by standard methods) was cleaved with Ndel 
and Xmn I and the approximately 850 base pair 
fragment was isolated. 

3b. 5 ug of plasmid pBR325 (BRL, Gaithers- 

burg, MD) , which contains the genes conferring 
resistance to chloramphenicol as well as to 
ampicillin and tetracycline, was cleaved with 
Bel l and the ends blunted with Klenow polymerase 
and deoxynucleot ides. After inactivating the 
enzyme, the mixture was treated with Ndel and 
the approximately 3185 base pair fragment 
was isolated. This fragment contains the 
genes for chloramphenicol and ampicillin 
resistance and the origin of replication. 

3c. 0.1 gg of the Ndel -XmnI fragment from o REV ITT 
and the Ndel -Bell fragment from pBR325 were 
ligated in 20 yl with T4 DNA ligase and the 
mixture used to transform comoetent cells of 
strain JM103. Cells resistant to both ampi- 
cillin and chloramphenicol were selected. 
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3d. Using an EcoR I and Ndel double digest of 

plasmid from selected clones, a plasmid was ■ 
selected giving fragment sizes of approxi- 
mately 2480, 1145, and 410 base pairs.* This 
5 is called plasmid pREVlTT/chl and has genes 

for resistance to both ampicillin and chloram- 
phenicol. 

4a. A double stranded fragment shown in Table 3 
10 was synthesized. This fragment, with a blunt 

end and an Sst I sticky end, contains recogni- 
tion sequences for several restriction enzyme 
sites. 

15 4b. 5 ug of pREVlTT/chl was cleaved with Nrul 

(which cleaves about 20 nucleotides from the 
Bel l site) and Sst I (which cleaves within the 
multiple cloning site). The larger fragment, 
approximately 3990 base pairs , was isolated 

20 from an agarose gel'. 

4c. 0.1 yg of the Nrul -SstI fragment from pREVlTT/chl 
and 0.01 ug of the synthetic fragment were 
treated with T4 DNA ligase in a volume of 
20 yl. 

25 4d. This mixture was transformed into strain JM103 

and ampicillin resistant clones were selected. 
4e. Plasmid was purified from several cldnes and 

screened by digestion with Mlul or Clal. 

Recombinant clones with the new multiple cloning. 
30 site will give one fragment when digested with - 

either of these enzymes , because each cleaves 

the plasmid once. 
4f . The sequence of the multiple cloning site was 

verified. This was done by restricting the 
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plasmid with Hpal and PvuII and isolating 
the 1395 base pair fragment, cloning it into 
the Sma l site of mpl8 and sequencing it by 
dideoxynucleotide sequencing using standard 
methods . 

4g. This plasmid, called pREV2.2 is diagrammed 
in Figure 2 of the drawings. 

Example 2~C«natruction of and expression from pRlO. 

Plasmid pRlO, which contains approximately 1275 
base pairs of DNA encoding the HTLV-III env gene from 
essentially the Kpnl site to the Bglll site, and from 
which is synthesized an approximately 95 kO fusion 
protein containing this portion of the g P 120 envelope 
protein, can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown 
in Table 4. This DNA fragment can be synthe- 
sized by standard methods (Itakura, et al. , 
supra, and references therein) and encodes 
a portion of g P 120. It has a blunt end on the 
5-' end and an end which will ligate with a 
BamHI overhang on the 3' end. 

2. Restricting 5 yg. of plasmid pBGl with Bell, 
filling in the overhanging ends with Klenow 
polymerase and deoxyribonucleotide triphos- 
phates (dNTPs) , restricting this fragment 
with BamHI and isolating the large fragment , - 
approximately 8.9 kb, from an agarose gel. 

3. Ligating 0.1 yg of the fragment in Table 4 with 
0.1 yg of the pBGl fragment in a volume of 

20 yl using T4 DNA ligase, transforming the 
ligation mixture into competent cell strain 
SG20251 (Gottesman, S. , Haloern, E. and Trisler, 
P [1981] Journal of Bacteriology 148:265-273), 
and selecting ampicillin resistant transf ormants 
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4. Selecting, using the Ahalll restriction 
pattern of purified plasmid, cells harboring the 
recombinant plasmid with the synthesized \ 
fragment in the orientation whereby, the : frag- 
ment blunt end ligated to the pBGl fragment 
filled-in- Bell end and the BamHI overhanging 
ends ligated together. Ahalll digestion of the 
proper plasmid gives fragment lengths of 
approximately 5300, 3170, 690 , 640 , 330, and 

20 base pairs. 

5, When the strain harboring this recombinant plas- 
mid is grown in 2% medium (2% yeast; extract, 
bactotryptone , casamino acids (Dif co , Detroit, 
MI), 0.2% potassium monobasic, 0,2% potassium 
dibasic, and 0.2% sodium dibasic) containing 

50 ug/ml ampicillin and the total complement 
of cellular proteins electrophoresed on an 
SDS-polyacrylamide gel, a prominent protein of 
approximately 95 kD can be visualized by either 
coomassie blue staining or by western blot 
analysis using as probe selected sera from 
AIDS, ARC, or HTLV-III infected individuals. 

Example 3- -Purification of recombinant protein containing 
HTLV-III envelope sequences from plasmid pRIO 

1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermentor (Chemapec, Woodbury, NY) in 2% medium. Fer- 
mentation temperature was 37°C, the pH was 6 .8, and air 
was provided at 1 wm. Plasmid selection was provided *' ' 
by 50 ug/ml ampicillin. Typical cell yield (wet weight') 
is 30 g/1. 

2. Cell lysis: 

50 g, wet cell weight, of E. coli containing the 
recombinant HTLV-III envelope fusion protein were 
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resuspended in a final volume of 100 ml in 50 mM 
Tris-Cl pH 8.0, 5 mM potassium e thy lene diamine tetr a - 
acetic acid (KEDTA) , 5 mM dithiothreitol (DTT) , 15 
mM e-mercaptoethanol, 0.52 Triton X-100, and 5 mM 
phenylmethylsulfonyl fluoride (PMSF) . 300 mg lysozyme 
was added and the suspension incubated for 30 min at 
room temperature. 

This material was lysed using a BEAD -BEATER 
(Biospec Products, Bartlesville , OK) containing an 
equal volumn of 0.1-0.15 gm glass beads. The lysis 
was done for 6 min at room temperature in 1 min 
intervals. The liquid was separated from the beads and 
centrifuged for 2.5 hr at 20,000 xg. The supernatant 
was removed and the pellet dissolved in 100 ml 8 M 
15 urea, 20 mM Tris-Cl pH 8.0, 5 mM DTT , . 1 5 mM e-mercap- 

toethanol, 5 mM PMSF, and 1 mM KEDTA. The pellet was 
solubilized using a polytron homogenizer (Beckman, 
Berkeley, CA) and centrifuged at 20,000 xg for 2 hr. 
3. Diethylaminoethyl (DEAE) chromatography: 

Supernatant was loaded onto a 550 ml column 
(5 cm x 28 cm) packed with DEAE Fast Flow Sepharose 
(Pharmacia, Piscataway, NJ) equilibrated in 8 M urea, 
20 mM Tris-Cl pH 8.0, 15 mM e-mercaptoethanol , and 
1 mM KEDTA at room temperature. The column was washed 
with 1.5 liters equilibration buffer, and the protein 
eluted with a 5.0 liter linear gradient from 0-0.8 M 
NaCl in equilibration buffer. The HTLV-I1I protein 
eluted at 0.2 M NaCl and was assayed using SDS-poly- 
acrylamide electrophoresis and following the prominent 
30 protein at approximately 95 kD. 

The fractions containing the HTLV-III protein 
were pooled and the protein concentrated to 10 ml 
using a stressed cell positive pressure concentrator 
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(Amicon, Danvers, MA) fitted with a 10,000 MW cut-off 
membrane (YM-10, Amicon). The concentrate was loaded !V 
onto a 500 ml column (2.5 cm x 100 cm) packed with 
superfine sephacryl S-300 (Pharmacia) equilibrated in L ' V 
8 M urea, 20 mM Tris-Cl, pH 8.0, 15 mM 6-mercaptoethanol , 
and 1 mM KEDTA.. The column was eluted with equilibra- 
tion buffer at room temperature. A flow rate of 0.5 
ml/min was maintained. The HTLV-III protein eluted at 
approximately 402 of the column volume. 

Example A— Construct ion of and expression from olasmid 
pPBl ~ ' ■ ' ■ ~ 

Plasmid pPBl, which contains approximate iy : 540 
base pairs of DNA encoding essentially the Hliv-III env 
15 gene from the PvuII site to the Bglll site, and from" "■' 

which is synthesized an approximately 26 kD fusion 
protein containing this portion of the gp!20 envelope 
protein can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown in 
20 Table 5: This DNA fragment can be synthesized by 

standard methods and encodes a portion of gpl20. 
It has a blunt end on the 5' end and an end which 
will ligate with a Bam HI overhang on the 3' end. 

2. Restricting 5 wg plasmid pREV2.2 with EcoRV and 
BamHI and isolating the large fragment, approximately, 
4 kD, from an agarose gel. 

3. Ligating 0.1 pg of the fragment in Table 5 with 0.1 
Ug of the pREV2.2 fragment in a volume of 20 yl 
using T4 DNA ligase, transforming the ligation mix- 
ture into competent cell strain SG20251 , and : selecting 
ampicillin resistant transformants . 

4. Using the Ahalll restriction pattern of purified 
plasmid, selecting cells harboring the recombinant 
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olasmid with the synthesized fragment in the 
orientation whereby the fragment blunt end ligated 
to the REV2.2 EcoRV end and the BamHI ^overhanging ends 
ligated together. Ahalll digestion of the proper 
plasmid gives fragment lengths of approximately 
1210, 1020, 750., 690, 500, 340, and 20 base pairs. 
When the strain harboring this recombinant plasmid 
is grown in 22 medium containing 50 pg/ml ampicillin 
and the total complement of cellular proteins 
electrophoresed on an SDS-polyacrylamide gel, a 
protein of approximately 26 kD can be visualized 
by either coomassie blue staining or by western 
blot analysis using as probe selected sera from 
AIDS, ARC, or HTLV-I1I infected individuals. 

Example 5- -Purification of recombinant pro tein con- 
taining HTLV-III envelope sequences from plasmid pPBI 

1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermentor in 2% medium. Fermentation temperature was 
37 °C, the pH was 6.8, and air was provided at 1 vvm. 
Plasmid selection was provided by 50 ug/ml ampicillin 
and 20 ;.g/ml chloramohenicol . Typical cell yield 
(wet weight) was 30 g/1. 

2. Cell Lysis: 

50 g, wet cell weight, of E. coli containing the 
recombinant HTLV-III envelope fusion protein were 
resuspended in a final volume of 100 ml in 50 mM 
Tris-Cl pH 8.0, 5 mM KEDTA, 5 mM DTT, 15 mM B-mercap- 
toethanol, 0.5% Triton X-100, and 5 mM PMSF. 300 mg 
lysozyme was added and the suspension incubated for 30 
min at room temperature. 

This material was lysed using a BEAD-BEATER 
(Biospec Products, Bartlesville , OK) containing an 
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equal volume of 0.1-0.15 um glass beads. The lysis 
was done for 6 min at room temperature in .1 min 
intervals. The liquid was separated from the beads 
and centrifuged for 2.5 hr at 20,000 xg. The super- 
natant was removed and the pellet was resuspended in 
100 ml 6 M guanidine-hydrochloride , 20 raM Tris-Cl pH 
8.0, 5 mM DTT, 15 mM e-mercaptoethanol , 5 mid PMSF, and 

1 mM KEDTA. The oellet was solubilized using a 
polytron homogenizer and centrifuged at 20,000 xg 
for 2 hr. 

The supemate (90 ml) was dialysed against k liters 
of 8 M urea, 20 mM Tris-Cl, pH 8.0, 1 mM EDTA, and 
15 mM B-mercantoethanol. Dialysis was done each time 
for 8 hr or longer with three changes of buffer. 
Spectraphor dialysis tubing (S/P, McGraw Park, IL) 
with a 3.5 kD MW cut-off was used. 
3. CM chromatography 

The dialysate was loaded onto a 550 ml column 
(5 cm x 28 cm) packed with CM Fast Flow Sepharose 
(Pharmacia) equilibrated in 8 M urea, 10 mM potassium 
phosphate pH 7.0, 15 mM S-mercaptoethanol , and 1 mM 
KEDTA at room temperature. The column was washed with 

2 liters equilibration buffer, and the protein eluted 
with a 5.0 liter linear gradient from 0-0 . 4 M NaCl. 
The HTLV-III protein (26 kD) eluted at approximately 
0.2 M NaCl as assayed by SDS-polyacrylamide gel 
electrophoresis. 

Example 6 — Construction of and expression from plasmid '•■ 
p590 "£ 

Plasmid p590, which contains approximately 1055 
base pairs of DNA encoding essentially the HTLV-III 
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env gene from the PvuII site to the Hindlll site, and 
from which is synthesized an approximately 86 kD 
fusion protein containing this portion of the gpl60 
envelope protein can be constructed as follows: 
1: Synthesizing the DNA with the sequence shown in 

Table 6: This DNA fragment can be synthesized by. 

standard methods and encodes a oortion of gpl60. 

It has a blunt end on the 5 ' end and an end which 

will ligate with a Hindlll overhang on the 3' end. 

2. Restricting 5 yg plasmid pREV2.2 with EcoRV and 
Hindlll and isolating the large fragment, approxi- 
mately 4 kD, from an agarose gel. 

3. Ligating 0.1 yg of the fragment in Table 6 with 
0.1 ug of the pREV2.2 fragment in- a volume of 20 
ml using T4 DNA ligase , transforming the ligation 
mixture into competent cell strain SG20251. and 
selecting ampicillin resistant transformants . 

4. Using the Aha II I restriction pattern of purified 
plasmid, selectiong cells harboring the recombinant 
plasmid with the synthesized fragment in the ori- 
entation whereby the fragment blunt end ligated to 
the pREV2.2 EcoRV end and the Hindlll overhanging 
ends ligated together. Ahalll digestion of the 
proper plasmid gives fragment lengths of approxi- 
mately 1740, 1020, 750. 690, 500, 340 , and 20. 

5. 5 yg of plasmid, purified from this strain, is 
restricted with Ndel and Smal. The approximately 
1425 base pair fragment is isolated from an agarose 
gel. The 1505 base pair fragment is fused to the 
DNA encoding the segment of gpl60. 

6. 5 yg of pBGlOl is restricted with BamHI , the over- 
hanging ends filled in with Klenow polymerase and 
dNTPs, and then restricted with Ndel. The approxi- 
mately 6.5 kD fragment is isolated from an agarose 
gel. 
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7. Ligating 0.1 ug of the Ndel-Smal fragment/with 0.1 
Ug of the pBGl fragment using T4 DNA ligase , trans- 
forming the ligation mixture into competent cell 
strain SG20251, and selecting ampicillin resistant 
transf ormants . 

8. Using the Ahalll restriction pattern of purified 
plasmid, selecting cells harboring the recombinant 
plasmid with the synthesized fragment in the ori- 
entation whereby the fragment blunt Smal end ligated 
to the Bam Hi /f illed-in end and the Ndel overhanging 
ends ligated together. Ahalll digestion of the 
proper plasmid gives fragment lengths of approxi- 
mately 5900 t 1020, 690, 430, and 20 base pairs. 

9. When the strain harboring this recombinant plasmid 
15 is grown in 2% medium containing 50 yg/ml ampicillin 

and the total complement of cellular proteins 
electrophoresed on an SDS-polyacrylamide gel , a 
protein of approximately 86 kD can be visualized 
by either coomassie blue staining or by western blot 
analysis using as probe selected sera from AIDS, ARC, 
or HTLV-III infected individuals. 
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Example 7- -Purification of recombinant protein con - 
taining HTLV-III envelope sequences from plasmid d590 

25 1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermentor in 2% medium. Fermentation temperature was 
37 °C, the -pH was 6.8, and air was provided at 1 wm. 
Plasmid selection was provided by 50 yg/ml ampicillin., 

30 Typical cell yield (wet weight) is 30 g/1. 

2. Cell Lysis: 

50 g, wet cell weight, of E. coli containing the 
recombinant HTLV-III envelope fusion protein were 
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resuspended in a final volume of 100 ml in 50 mM 
Tris-Cl pH 8.0, 5 mM KEDTA , 5 mM DTT, 15 mM p-mercapto- 
ethanol. 0.5% Triton X-100, and 5 mM PMSF. 300 mg 
lysozyme was added and the suspension incubated for 30 
min at room temoerature. * 

This material was lysed using a Bead-Beater 
containing 0.1-0.15 mm glass beads. The lysis was 
done for 6 min at room temoerature in 1 min intervals. 
The liquid was separated from the beads and centrifuged 
for 2.5 hr at 20,000 xg. The supernatant was removed 
and the pellet was resuspended in 100 ml 6 M guanidine- 
hydrochloride, 20 mM Tris-Cl pH 8.0, 5 mM DTT, 15 mM 
B-mercaptoethanol , 5 mM PMSF, and 1 mM KEDTA. The 
pellet was solubilized using a polytron homogenizer and 
centrifuged at 20,000 xg for 2 hr. 

The supernate (90 ml) was dialysed against U liters 
of 8 M urea, 20 mM Tris-Cl. pH 8.0, 1 mM EDTA, and 15 
mM .--mercaptoethanol. Dialysis was done each time for 
8 hr or longer with three changes of buffer. Spectra- 
phor dialysis tubing with a 3.5 kD MW cut-off was 
used. 

3. Die-thylaminoethyl (DEAE) chromatography 

Dialysate was loaded onto a 550 ml column (5 cm 
x 28 cm) packed with DEAE Fast Flow Sepharose (Pharm- 
macia) equilibrated in 8 M urea, 20 mM Tris-Cl pH 8.0, 
15 mM B-mercaptoethanol , and 1 mM KEDTA at room tem- 
perature. The column was washed with 1.5 liters 
equilibration buffer, and the protein eluted with a 
5.0 liter linear gradient from 0-0.8 M NaCl in equili- 
bration buffer. The HTLV-III protein eluted at 0.4 M 
NaCl and was assayed using SDS-polyacrylamide electro- 
phoresis and following the prominent protein at 
approximately 86 kD. 
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The fractions containing the HTLV-Ili protein 
were pooled and the protein concentrated to 10 ml using 
a stressed cell positive pressure concentrator (Amicon) 
fitted with a 10,000 MW cut-off membrane (YM-10, 
Amicon) . The concentrate was loaded onto a 500 ml 
column (2.5 cm x 100 cm) packed with superfine 
sephacryl S-300 (Pharmacia) equilibrated in 8 M urea, 
20 mM Tris-Cl, pH 8.0, 15 mM B-mercaptoethanol , and 
1 mM KEDTA. The column was eluted with equilibration 
buffer at room temperature. A flow rate of 0.5 ml/inin 
was maintained. The HTLV-III protein eluted at 
approximately 40% of the column volume. 

! 

Example 8- -Construction of and expression from plasmid; 
pKHl 

Plasmid pKHl, which contains approximately 1830 
base pairs of DNA encoding essentially the HTLV-III 
env gene from the Kpnl site to the Hitidlll site, and 
from which is synthesized an approximately 70 kD 
fusion protein containing this portion of thfe gpl60 
envelope protein, can be constructed as follows: 

1. Synthesizing the DNA with the sequence shown 
in Table 7: This DNA fragment can be synthe- 
sized by standard methods and encodes a portion 
of gpl60. It has a blunt end on the 5 r end 
and. an end which will ligate with a Hindlll 

— — -"jii^' 

overhang on the 3' end. 

2. Restricting 5 yg plasmid pREV2.2 with Mlul , 
treating the DNA with Klenow polymerase to 
blunt the ends, treating with Hin dlll and 
isolating the large fragment, approximately 
5 kD, from an agarose gel. 

3. Ligating 0.1 yg of the fragment in Table 7 
with 0.1 yg of the pREV 2.2 fragment in a 
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volume of -20 yl using T4 DNA ligase , trans- 
forming the ligation mixture into competent 
cell strain CAG629, and selecting ampicillin 
resistant transf ormants . 
4. Using the Ahalll restriction pattern of puri- 
fied plasmid, selecting cells harboring the 
recombinant plasmid with the synthesized 
fragment in the orientation whereby the frag- 
ment blunt end ligated to the REV2.2 Mlul 
end and the Hindi II overhanging ends ligated 
together. Ahalll digestion of the proper 
plasmid gives fragment lengths of approximately 
1730, 1020, 750, 690, 640, 600, 340, and 20 
base pairs. When the strain harboring this 
recombinant plasmid is grown in 21 medium 
containing 50 ug/ral ampicillin and the total 
complement of cellular proteins electrophoresed 
on an SDS-polyacrylamide gel , a protein of 
approximately 70 kD can be visualized by either 
Coomassie blue staining or by Western Blot 
analysis using as probe selected sera from 
AIDS, ARC, or HTLV-III infected individuals. 

Example 9— Purif ication of recombinant protein containing 
HT LV-III envelope sequences from pla smid pKHl 
1. Growth of cells: 

Cells were grown in a 10 liter volume in a Chemap 
fermenter in 2% medium. Fermentation temperature 
was 32°C, the pH was 6.8, and air was provided at 
1 wm. Plasmid selection was provided by 50 ug/ml 
ampicillin. Typical cell yield (wet weight) is 30 
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2. Cell lysis: 

50 g, wet cell weight, of E. coli containing the 
recombinant HTLV-III envelope fusion protein were 
resuspended in a final volume of 100 ml' in 50 mM 
5 Tris-Cl pH 8.0, 5 mM KEDTA, 5 mM dithiothreitol (DTT) , 

15 mM B-mercaptoethanol, 0.5% Triton X-JOO and 5 mM 
PMSF. 300 mg lysozyme was added and the suspension 
incubated for 30 min at room temperature. 

This material was lysed using a BEAD-BEATER™ 

10 (Biospec Products) containing an equal volume of 0.1- 

0.15 urn glass beads. The lysis was done for 6 min 
at room temperature in 1 min intervals. The liquid 
was separated from the beads and centrifuged for 2.5 
hr at 20 ,000 xg. The supernatant was removed and the; 

15 pellet dissolved in 100 ml 8 M urea, 20 mti'Tris-Cl 

pH 8.0, 5 mM DTT, 15 mM B-mercaptoethanol, 5 mM PMSF, 
and 1 mM KEDTA. The pellet was solubilized using a 
polytron homogenizer (Beckman, Berkeley , CA) and 
centrifuged at 20,000 xg for 2 hr. 

20 3. DEAE chromatography: 

Supernatant was loaded onto a 550 ml column (5 cm 
x 28 cm) packed with DEAE Fast Flow Sepharose '(Pharmacia) 
equilibrated in 8 M urea, 20 mM Tris-Cl pH 8.0, .15 
mM B-mercaptoethanol, and 1 mM KEDTA at room tempera- 

25 ture. The column was washed with 1.5 liters equilibra- 

tion buffer, and the protein eluted with a 5.0 liter 
linear gradient from 0-0.8 M NaCl in equilibration 
buffer. The HTLV-III protein eluted at 0.2 M NaCl 
and was assayed using SDS-polyacrylamide electrophoresis 

30 and following the protein at approximately 70 kD. 

The fractions containing the HTLV-III protein 
were pooled and the protein concentrated to 10 ml 
using a stressed cell positive pressure concentrator 
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(Amicon) fitted with a 10,000 MW cut-off membrane 
(YM-10, Amicon). The concentrate was loaded onto a 
500 ml column (2.5 cm x 100 cm) packed with superfine 
sephacryl S-300 (Pharmacia) equilibrated in 8 M urea, 
20 mM Tris-Cl, pH 8.0, 15 mM B-mercaptoethanol , and 
1 mM KEDTA. The column was eluted with equilibration 
buffer at room temperature. A flow rate of 0.5 ml/min 
was maintained. The HTLV-III protein eluted at 
approximately 40% of the column volume. 
4. SDS-polyacrylamide electrophoresis: 

The fractions containing KHl were pooled and the 
protein concentrated using a stressed cell positive 
pressure concentrator fitted with a 10,000 MW cutoff 
membrane. 2 mg of protein was mixed -with loading 
buffers and electrophoresed through a preparative 
SDS-polyacrylamide gel (40 cm x 20 cm x 4 mm) as 
described by M.W. Hunkapiller, E. Lujan, F. Ostrander, 
and L.E. Hood, Methods in Enzymology 91:227-236 (1983). 
The 70 kD HTLV-III protein was visualized with 
Coomassie blue stain and eluted from the gel as 
described. The protein can be removed from the SDS 
by precipitation with acetone (Dynan, W.J. , Jendrisak, 
J.J., Hager, D.A. and Burgess, R.R. [1981] J. Biol. 
Chem. 256:5860-5865). 



0255190 



-35- 
Table 1 

5 ' GATCAAGCTTCTGCAGTCGACGCATGCGGATCCGGTACCCGGGAGCTCG 3 ' 
TTCGAAGACGTCAGCTGCGTACGCCTAGGCCATGGGCCCTCGAGCTTAA 



Table 2 

5 1 CGGTACCAGCCCGCCTAATGAGCGGGCTTTTTTTTGACGT 3 

TGCAGCCATGGTCGGGCGGATTACTCGCCCGAAAAAAAAC 



Table 3 



Mlul EcoRV Clal BamHI Sail Hindlll Smal 

CGAACGCGTGGCCGATATCATCGATGGATCCGTCGACAAGCTTCCCGGGAGCT 
GCTTGCGCACCGGCTATAGTAGCTACCTAGGCAGCTGTTCGAAGGGCCC 
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Table 4 



5 ' AATTCCCTGTGTGGAAGGAAGCA 
TTAAGGGACACACCTTCCTTCGT 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACAT 
TGGTGGTGAGATAAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTA 

AATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 
TTACAAACCCGGTGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCAT 

GTATTGGTAAATGTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAA 
CATAACCATTTACACTGTCTTTTAAAATTGTACACCTTTTTACTGTACCATCTT 

CAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTA 
GTCTACGTACTCCTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACAT 

AAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACT 
TTTAATTGGGGTGAGACACAATCAAATTTCACGTGACTAAACTTCTTACTATGA 

AATACCAATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAAC 
TOATGG^ATCATCATCGCCCTCTTACTATTACCTCTTTCCTCT 

TGrTCTTTCAATATCAGCACAAGCATAAGAGGTAAGGTGCAGAAAGAATATGCA 
ACGAGAAAGTTATAGTCGTGTTCGTATTCTCCATTCCACGTCTTTCTTATACGT 

TTTTTTTATAAACTTGATATAATACCAATAGATAATGATACTACCAGCTATACG 
AAAAAAATATTTGAACTATATTATGGTTATCTATTACTATGATGGTCGATATGC 

TTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTT 
AACTGTTCAACATTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCCATAGGAAA 

GAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGT 
CTCGGTTAAGGGTATGTAATAACACGGGGCCGACCAAAACGCTAAGATTTTACA 

AATAATAAGACGTTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAA 
TTATTATTCTGCAAGTTACCTTGTCCTGGTACATGTTTACAGTCGTGTCATGTT 

TGTACACATGGAATTAGGCCAGTAGTATCAACTCAACTGCTGTTAAATGGCAGT 
ACATGTGTACCTTAATCCGGTCATCATAGTTGAGTTGACGACAATTTACCGTCA 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 
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Table U (cont.) 



ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTqGTGTAACATTGTAATCATCTGGT, 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCdAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCGTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAAl'TCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATT^AGTTGTGTTGAG 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTGAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTQCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACGTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTOCTCTACCACCATTATCGTTG 

AATGAGTCCGA 3' 
TTACTCAGGCTCTAG 
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Table 5 



5 ' CTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
GACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTGGTCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGA 3'. 
TTACTCAGGCTCTAG 
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5 ' 5T^ CC ^ TCTGTAG; ^ TT ^ T TGTACftA(3ACCeAAC 
GACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 



/ 



-AO- 



Table 6 (cont.) 



GATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACT 
CTAGTTGTCGAGGACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGA 

GCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAAT 
CGACACGGAACCTTACGATCAACCTCATTATTTAGAGACCTTGTCTAAACCTTA 

AACATGACCTGGATGGAGTGGGACAGAGAAATTAACAATTACACA 3 1 

TTGTACTGGACCTACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGA 
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Table 7 

5' AATTCCCTGTGTGGAAGGAAGCA 
TTAAGGGACACACCTTCCTTCGT 

accaccactctattttgtgcatcagatgctaaagcatatgatkcagaggtacat 
tggtggtgagataaaacacgtagtctacgatttcgtatactatg^jctccatgta 

aatgtttgggccacacatgcctgtgtacccacagaccccaacccacaagaagta 
ttacaaacccggtgtgtacggacacatgggtgtctggggtogggtgttcttcat 

gtattggtaaatgtgacagaaaattttaacatgtggaaaaatgaca*ggtagaa • 
cataaccatttacjvctgtcttttaaaattgtacacctta^actgtaccatctt 

cagatgcatgaggatataatcagtttatgggatcaaagcctaaagccatgtgta 
gtctacgtactcctatattagtcaaataccctagtttcggatttcggtacacat 

aaattaaccccactctgtgttagtttaaagtgcactgattt6aagaatgatact 
tttaattggggtgagacacaatcaaatttcacgtgactaaacttgttactatga 

aataccaatagtagtagcgggagaatgataatggagaaaggagagataaaaaac 
ttatggttatc^tcatcgccctcttactattacctctttcctctcta'ttttttg 

tgctctttcaatatcagcacaagcataagaggtaaggtgcagaaagaatatgca 
acgagaaagttatagtcgtgttcgtattctccattccacgtc^^jCttatacgt 

tttttttataaacttgatataataccaatagataatgatactacgagctatacg ' 
aaaaaaatatttgaactatattatggttatctattactatgatggtcgatatgc 

ttgacaagttgtaacacctcagtcattacacaggcctgtccaaaggtatccttt 
aactgttcaacattgtggagtcagtaatgtgtccggacaggtttccataggaaa 

gagccaattcccatacattattgtgccccggctggttttgcgattctaaaatgt 
ctcggttaagggtatgtaataacacggggccgaccaaaacgctaagattttaca 

aataataagacgttcaatggaacaggaccAtgtacaaatgtcagcacagtacaa 
ttattattctgcaagttaccttgtcctggtacatgtttacagtcgtgtcatgtt 

tgtacacatggaattaggccagtagtatcaactcaactgctgttaaatggcagt 
acatgtgtaccttaatccggtcatcatagttgagttgacgacaatttaccgtca : 

ctagcagaagaagaggtagtaAttagatctgccaatttcacagacaatgctaaa 
gatcgtcttcttctccatcattaatctagacggttaaagtgtctgttacgattt \ 

accataatagtacagctgaaccaatctgtagaaattaattg^pacaagacccaac*ii : 
tggtattatcatgtcgacttggttagacatctt!?aattaacatgttctgggttg 

aacaatacaagaaaaagtatccgtatccagagaggaccagggagagcatttgtt : 
ttgttatgttctttttcataggcataggtctctcctaatccctcpcgtaaacaa 

acaataggaaaaataggaaatatgagacaagcacattgtaacAttagtagagca 

TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAA^CATCTCGT 



Table 7 (cont.) 



AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTAGTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGA 
TTACTCAGGCTCTAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACCTCT 

AGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCC 
TCACTTAATATATTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGG 

ACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGA 
TGGTTCCGTTTCTCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCT 

GCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCA 
CGAAACAAGGAACCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGT 

ATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAG 
TACTGCGACTGCCATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTC 

AACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTC 
TTGTTAAACGACTCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAG 

TGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAG 
ACCCCGTAGTTCGTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTC 

GATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACT 
CTAGTTGTCGAGGACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGA 

GCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAAT 
CGACACGGAACCTTACGATCAACCTCATTATTTAGAGACCTTGTCTAAACCTTA 

AACATGACCTGGATGGAGTGGGACAGAGAAATTAACAATTACACA 3 ' 

TTGTACTGGACCTACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGA 
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Table 8 

Amino acid sequence of fusion; protein RIO 

• MetLeuArg 

ProValGluThrProThrArgGluIleLysLysLeuAspGlyLeuTrpAiaPhe 

SerLeuAspArgGluAsnCysGlylleAspGlnPheProValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluyalHis 

AsnValTrpAlaThrHisAlaCysValProThrAspProAsnProGlnGluVal 

ValLeuValAsnValThrGluABnPheAsnMetTrpLysAsnAspMetyalGlu 

GlnMetHisGluAsplleileSerLeuTrpAspGlnSerLeuLysPrpCysVal 

LysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThr 

AsnThrAsnSerSerSerGlyArgMetlleMetGluLysGlyGlMlieLysAsn 

CysSerPheAsnlleSerThrSerlleArgGlyLysValGlnLysGluTyrAla 

PhePheTyrLysLeuAspIlelleProlleAspAsnAspThrTbrSerTyrThr 

Le uTh rSe rCy sAsnTh r Se rVal 1 1 eTh r GlnAlaCysProLy sValSe rPhe 

GluProIleProlleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCys 

AsnAsnLysThrPheAsnGlyThrGlyProCysThrAsnValSerThrValGln 

CysThrHisGlylleArgProValValSerThrGlnLeuLeuLeuAsnGlySer 

LeuAlaGluGluGluValVallleArgSerAlaAsnPheThrAspAsnAlaLys 

ThrllelleValGlnLeuAsnGlnSerValGluIleAsnCysThrAEgProAsn 

AsnAsnThrArgLysSerlleArglleGlnArgGlyProGlyArgAlaPheVal 

ThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsnlleSerArgAla 

LysTrpAsnAsnThrLeuLysGlnlleAspSerLysLeuArgGiuGinPheGly 

AsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGluIleVal 

ThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeu 

PheAsnSerThrTrpPheAsnSerThrTrpSerThrLysGlySerAsnAsnT.hr 

GluGlySerAspThrlleThrLeuProCysArglleLysGinllelleAsnMet 

TrpGlnGluValGlyLysAlaMetTyrAlaProProlleSerGlyGlnlleArg 

CysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnSerAsn 



table 8 (cont.) 
AsnGluSerGluIleHisArgSerValMetLeuTyrThrThrProAsnThrTrp 
ValAspAspIleThrValValThrHisValAlaGlnAspCysAsnHisAlaSer 
ValAspTrpGlnValValAlaAsnGlyAspValSerValGluLeuArgAspAla 
AspGlnGlnValValAlaThrGlyGlnGlyThrSerGlyThrLeuGlnValVal 
AsnProHisLeuTrpGlnProGlyGluGlyTyrLeuTyrGluLeuCysValThr 
AlaLysSerGlnThrGluCysAspIleTyrProLeuArgValGlylleArgSer 
ValAlaValLysGlyGluGlnPheLeuIleAsnHisLysProPheTyrPheThr 
GlyPheGlyArgHisGluAspAlaAspLeuArgGlyLysGlyPheAspAsnVal 
LeuMetValHisAspHisAlaLeuMetAspTrpIleGlyAlaAsnSerTyrArg 
ThrSerHisTyrProTyrAlaGluGluMetLeuAspTrpAlaAspGluHisGly 
IleValVallleAspGluThrAlaAlaValGlyPheAsnLeuSerLeuGlylle 
GlyPheGluAlaGlyAsnLysProLysGluLeuTyrSerGluGluAlaValAsn 
GlyGluThrGlnGlnAlaHisLeuGlnAlalleLysGluLeuIleAlaArgAsp 
LysAsnHisProSerValValMetTrpSerlleAlaAsnGluProAspThrArg 
ProGlnGlyAlaArgGluTyrPheAlaProLeuAlaGluAlaThrArgLysLeu 
AspProThrArgProIleThrCysValAsnValMetPheCysAspAlaHisThr 
AspThrlleSerAspLeuPheAspValLeuCysLeuAsnArgTyrTyrGlyTrp 
TyrValGlnSerGlyAspLeuGluThrAlaGluLysValLeuGluLysGluLeu 
LeuAlaTrpGlnGluLysLeuHisGlnProIlellelleThrGluTyrGlyVal 
AspTh rLeuAlaGlyLeuH isSe rMetTy rTh r AspMetT rpSe rGluGl uTy r 
GlnCysAlaTrpLeuAspMetTyrHisArgValPheAspArgValSerAlaVal 
ValGlyGluGlnValTrpAsnPheAlaAspPheAlaThrSerGlnGlylleLeu 
ArgValGlyGlyAsnLysLysGlyllePheThrArgAspArgLysProLysSer 
AlaAlaPheLeuLeuGlnLysArgTrpThrGlyMetAsnPheGlyGluLysPro 
GlnGlnGlyGlyLysGln 
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Table 8A 

Nucleotide sequence encoding fusion protein RIO 

ATGTTACGT 
TACAATGCA 

CCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTC 
GGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGACACCCGTAAG 

AGTCTGGATCGCGAAAACTGTGGAATTGATCAATTCCCTGTGTGGAAGGAAGCA 
TCAGACCTAGCGCTTTTGACACCTTAACTAGTTAAGGGACACACCTTCCTTCGT 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACAT 
TGGTGGTGAGATAAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTA 

AATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 
TTACAAACCCGGTGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCAT 

GTATTGGTAAATGTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAA 

cataaccatttacactgtcttttaaaattgtacacctttttact6taccatctt 

cagatgcatgaggatataatcagtttatgggatcaaagcctaaagccatgtgta 
gtctacgtactcctatattagtcaaataccctagtttcggatttcggtacacat 

aaattaaccccactctgtgttagtttaaagtgcactgatttgaagaatgatact 
tttaattggggtgagacacaatcaaatttcacgtgactaaacttctfactatga 

aataccaatagtagtagcgggagaatgataatggagaaaggagagataaaaaac 
ttatggttatcatcatcgccctcttactattacctctttcctctctaitttttg 

tgctctttcaatatcagcacaagcataagaggtaaggtgcagaaagaatatgca 
acgagaaagttatagtcgtgttcgtattctccattccacgtctttcttatacgt 

tttttttataaacttgatataataccaatagataatgatActaccaggtatacg 

AAAAAAATATTTGAACTATATTATGGTTATCTATTACTATGATGGTCGATATGC 

ttgacaagttgtaacacctcagtcattacacaggcctgtccaaagg^atccttt 

AACTGTTCAACATTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCCATAGGAAA 
GAGCCAATTCCCATACATTATTGTGCCCCGGCTGGi'TTTGCGATTCTAAAATGT 

ctcggttaagggtatgtaataacacggggccgaccaaaacgctaagattttaca 

aataataagacgttcaatggaacaggaccatgtagaaatgtcagcacagtacaa 
ttattattctgcaagttaccttgtcctggtacatgtttacagtcgtgtcatgtt 

tgtacacatggaattaggccagtagtatcaactgaactgctgttaaatggcagt 
acatgtgtaccttaatccggtcatcatagttgagttgacgacaatttaccgtca 

ctagcagaagaagaggtagtaattagatctgccaatttcacagacaAtgctaaa 
gatcgtcttcttctccatcattaatctagacggttaaagtgtctgttacgattt 

accataatagtacagctgaaccaatctgtagaaattaattgtacaagacccaac 

TGGTATTATCATGTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 
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Table 8A (cont.) 



AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCCATCGCAGCGTAATGCTCTACACCACGCCGAACACCTGG 
TTACTCAGGCTCTAGGTAGCGTCGCATTACGAGATGTGGTGCGGCTTGTGGACC 

GTGGACGATATCACCGTGGTGACGCATGTCGCGCAAGACTGTAACCACGCGTCT 
CACCTGCTATAGTGGCACCACTGCGTACAGCGCGTTCTGACATTGGTGCGCAGA 

GTTGACTGGCAGGTGGTGGCCAATGGTGATGTCAGCGTTGAACTGCGTGATGCG 
CAACTGACCGTCCACCACCGGTTACCACTACAGTCGCAACTTGACGCACTACGC 

GATCAACAGGTGGTTGCAACTGGACAAGGCACTAGCGGGACTTTGCAAGTGGTG 
CTAGTTGTCCACCAACGTTGACCTGTTCCGTGATCGCCCTGAAACGTTCACCAC 

AATCCGCACCTCTGGCAACCGGGTGAAGGTTATCTCTATGAACTGTGCGTCACA 
TTAGGCGTGGAGACCGTTGGCCCACTTCCAATAGAGATACTTGACACGCAGTGT 

GCCAAAAGCCAGACAGAGTGTGATATCTACCCGCTTCGCGTCGGCATCCGGTCA 
CGGTTTTCGGTCTGTCTCACACTATAGATGGGCGAAGCGCAGCCGTAGGCCAGT 

GTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTACT 
CACCGTCACTTCCCGCTTGTCAAGGACTAATTGGTGTTTGGCAAGATGAAATGA 

GGCTTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTG 
CCGAAACCAGCAGCACTTCTACGCCTGAACGCACCGTTTCCTAAGCTATTGCAC 

CTGATGGTGCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGT 
GACTACCACGTGCTGGTGCGTAATTACCTGACCTAACCCCGGTTGAGGATGGCA 
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Table 8A (cont, ) 



ACCTCGCATTACCCTTACGCTGAAGAGATGCTCX3ACTGGGCAGAT<JAACATGGC 
TGGAGCGTAATGGGAATGCGACTTCTCTACGAGCTGACCCGTCTACTTGTACCG 

ATCGTGGTGATTGATGAAACTGCTGCTGTCGGCTTTAACCTCTCTTTAGGCATT 
TAGCACCACTAACTACTTTGACGACGACAGCCGAAATTGGAGAGAAATCCGTAA 

GGTTTCGAAGCGGGCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAAC 
CCAAAGCTTCGCCCGTTGTTCGGCTTTCTTGACATGTCGCTTC^CGTCAGTTG 

GGGGAAACTCAGCAAGCGCACTTAGAGGCGATTAAAGAGCTGATA^CQCGTGAC 
CCCCTTTGAGTCGTTCGCGTGAATGTCCGCTAATTTCTCGAOTATCGCGCACTG 

AAAAACCACCCAAGCGTGGTGATGTGGAGTATTGCCAACGAACCGGAT^ 
TTTTTGGTGGGTTCGCACCACTAC^CCTCAT^ 

CCGCAAGGTGCACGGGAATATTTCGCGCCACTGGCGGAAGCAACGCGTAAACTC 
GGCGTTCCACGTGCCCTTATAAAGCGCGGTGACCGCCTTCGTTGCGCATTTGAG 

GACCCGACGCGTCCGATCACCTGCGTCAATGTAATGTTCTGCGACGCTCACACC 
CTGGGCTGCGCAGGCTAGTGGACGCAGTTACATTACAAGACG0«iGAGTqTGG 

GATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAACCG^TATTACGGATGG 
CTATGGTAGTCGCTAGAGAAACTACACGACACGGACTTGGCAATAATGCCTACC 

TATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACTT 
ATACAGGTTTCGCCGCTAAACCTTTGCCGTCTCTTCCATGACCTTTTTCTTGAA 

CTGGCCTGGCAGGAGAAACTGCATCAGCCGATTATCAT.CACCGAATACGGCGTG 
GACCGGACCGTCCTCTTTGACGTAGTCGGCTAATAGTAGTGGCTOATGCCGCAC 

GATACGTTAGCCGGGCTGCACTCAATGTACACCGACATGTGGAGTGAAGAGTAT 
CTATGCAATCGGCCCGACGTGAGTTACATGTGGCTGTACACCTCACTTCTCATA 

CAGTGTGCATGGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTC 
GTCACACGTACCGACCTATACATAGTGGCGCAGAAACTAGCGCAGTCGCGGCAG 

GTCGGTGAACAGGTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTG 
CAGCCACTTGTCCATACCTTAAAGCGGCTAAAACGCTGGAGdGTTCCGTATAAC 

CGCGTTGGCGGTAACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCG 
GCGCAACCGCCATTGTTCTTTCCCTAGAAGTGAGCGCTGGCGTTTGGbTTCAGC 

GCGGCTTTTCTGCTGCAAAAACGCTGGACTGGCATG7VACTTCGGTGAAAAACCG 
CGCCGAAAAGACGACGTTTTTGCGACCTGACCGTACTTGAAGCCAeTTTTTGGC 

CAGCAGGGAGGCAAACAA 
GTCGTCCCTCCGTTTGTT 
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Table 9 

Amino acid sequence of fusion protein PB1 

MetLeuArg 

ProValGluThrProThrArgGluIleLysLysLeuAspGlyLeuTrpAlaPhe 

SerLeuAspArgGluArgValAlaAspLeuAsnGlnSerValGluIleAsnCys 

ThrArgProAsnAsnAsnThrArgLysSerlleArglleGlnArgGlyProGly 

ArgAlaPheValThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsn 

IleSerArgAlaLysTrpAsnAsnThrLeuLysGlnlleAspSerLysLeuArg 

GluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAsp 

ProGluIleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsn 

SerThrGlnLeuPheAsnSerThrTrpPheAsnSerThrTrpSerThrLysGly 

SerAsnAsnThrGluGlySerAspThrlleThrLeuProCysArglleLysGln 

IlelleAsnMetTrpGlnGluValGlyLysAlaMetTyrAlaProProIleSer 

GlyGlnlleArgCysSerSerAshlleThrGlyLeuLeuLeuThrArgAspGly 

GlyAsnSerAsnAsnGluSerGluIleArgArgGlnAlaSerArgGluLeuGlu 

PheLeuLysThrLysGlyProArgAspThrProIlePhelleGly 
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Table 9A 

Nucleotide sequence encoding fusion protein PB1 



ATGTTACGTCCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTG 
TACAATGCAGGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGAC 

TGGGCATTCAGTCTGGATCGCGAACGCGTGGCCGATCTGAACCAATGTGTAGAA 
ACCCGTAAGTCAGACCTAGCGCTTGCGCACCGGCTAGACTTGGTTAGACATCTT 

ATTAATTGTACAAGACCCAACAACAATACAAGAAAAAGTATCCGTATCCAGAGA 
TAATTAACATGTTCTGGGTTGTTGTTATGTTCTTTTTCATAGGCATAGGTCTCT 

GGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGCA 
CCTAATCCCTCTCGTAAACAATGTTATCCTTTTTATCCTTTATACTGTGTTCGT 

CATTGTAACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAOATAGC 
GTAACATTGTAATCATCTCGTTTTACCTTATTGTGAAATTTTGTGTATCTATCG 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAGTCCTCA 
TTTAATTCTCTTGTTAAACCTTTATTATTTTGTTATTAGAAAOT 

GGAGGGGACCCAGAAATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTC 
CCTCCCCTGGGTCTTTAACATTGCGTGTCAAAATTAACACCTCCGCTTAAAAAG 

TACTGTAATTCAACAC^CTGTTTAATAGTACTTGGTTTAATAGTACTTGGAGT 
ATGACATTAAGTTGTGTTGACAAATTATCATGAACCAAATTATCATGAACCTCA 

ACTAAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACCCTCCCATGCAGA 
TGATTTCCCAGTTTATTGTGACTTCCTTCACTGTGTTAGTGGGAGGGTACGTCT 

ATAAAACAAATTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCT 
TATTTTGTTTAATATTTGTACACCGTCCTTCATCCTTTTCGTTAGATACGGGGA 

CCCATCAGTGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACA 
GGGTAGTCACCTGTTTAATCTACAAGTAGTT^ATAAtGTCCCGACGATAATTGf 

AGAGATGGTGGTAATAGCAACAATGAGTCCGAGATCCGTCGACAAGCTTCCCGG 
TCTCTACCACCATTATCGTTGTTACTCAGGCTCTAGGCAGCTGTTCGAAGGGCC 

GAGCTCGAATTCTTGAAGACGAAAGGGCCTCGTGATACTCCTATTTOTATAGGT 
CTCGAGCTTAAGAACTTCTGCTTTCCCGGAGCACTATGCGGATAAAMTATCCA 
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Table 10 

Amino acid sequence of fusion protein 590 

MetLeuArgProValGluThr 
ProThrArgGluIleLysLysLeuAspGlyLeuTrpAlaPheSerLeuAspArg 
GluArgValAlaAspLeuAsnGlnSerValGluIleAsnCysThrXrgProAsn • 
AsnAsnThrArgLysSerlleArglleGlnArgGlyProGlyArgAlaPheVal 
ThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsnlleSerArgAla 
LysTrpAsnAsnThrLeuLysGlnlieAspSerLysLeuArgGluGlnPheGly 
AsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGluIleVal 
ThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeu 
PheAsnSerThrTrpPheAsnSerThrTrpSerThrLysGlySerAsnAsnThr 
GluGlySerAspThrlleThrLeuProCysArglleLysGlnllelleAsnMet 
TrpGlnGluValGlyLysAlaMetTyrAlaProProIleSerGlyGlnlleArg 
CysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnSerAsn 
AsnGluSerGluIlePheArgProGlyGlyGlyAspMetArgAspAsnTrpArg 
SerGluLeuTyrLysTyrLysValValLysIleGluProLeuGlyValAlaPro 
ThrLysAlaLysArgArgValValGlnArgGluLysArgAlaValGlylleGly 
AlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMetGlyAlaAlaSer 
MetThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGIn 
AsnAsnLeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrVal 
TrpGlylleLysGlnLeuGlnAlaArglleLeuAlaValGluArgTyrLeuLys 
AspGlnGlnLeuLeuGlylleTrpGlyCysSerGlyLysLeuIleCysThrThr 
AlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrpAsn 
AsnMetThrTrpMetGluTrpAspArgGluIleAsnAsnTyrThrSerPhePro 
IleHisArgSerValMetLeuTyrThrThrProAsnThrTrpValAspAspIle 
ThrValValThrHisValAlaGlnAspCysAsnHisAlaSerValAspTrpGln 
ValValAlaAsnGlyAspValSerValGluLeuArgAspAlaAspGlnGlnVal 



Table 10 (cont.) 



ValAlaThrGlyGlnGlyThrSerGlyThrLeuGlnValValAeriProBisLeu 

TrpGlnProGlyGluGlyTyrLeuTyrGluLeuCysValThrAlaLysSerGln 

ThrGluCysAspIleTyrProLeuArgValGlylleArgSerValAlaValLys 

GlyGluGlnPheLeuIleAsnHisLysProPheTyrPheThrGlyPheGlyArg 

HieGluAspAlaAspLeuArgGlyLyBGlyPheAspABriValLeuMetValHis 

AspHiBAlaLeuMetAspTrpIleGlyAlaAsnSferTy rArgThrSe rHisTyr 

ProTyrAlaGluGluMetLeuAspTrpAlaAspGluHisGlylleValVallle 

AspGluThrAlciAlaValGlyPheAsnLeuSerLeuGlylieCSlyPheGluAla 

GlyAsnLysProLysGluLeuTyrSerGluGluAlaValAsnGlyGluthrGIn 

GlnAlaHisLeuGlnAlalleLysGluLeuIleAlaArgAspLysAsnHisPro 

SerValValMetTrpSerlleAlaAsnGluProAspThrArgProGlnGlyAla 

ArgGluTyrPheAlaProLeuAlaGluAlaThrArgLysLeuAspProThrArg 

ProIleThrCysValAsnValMetPheCysABpAlaHisThrAspThrlleSer 

AspLeuPheAspValLeuCysLeuAsnArgTyrTyrGlyTrpTyrValGlnSer 

GlyAspLeuGluThrAlaGluLysValLeuGluLysGluLeuLeuAlaTrpGln 

GluLysLeuHisGlnProIlellelleThrGluTyrGlyValAspThrLeuAla 

GlyLeuHisSerMetTyrThrAspMetTrpSerGluGluTyrGlntysAlaTrp 

LeuAspMetTyrHisArgValPheAspArgValSerAlaValValGlyGluGln 

ValTrpAsnPheAlaAspPheAlaThrSerGlnGlylleLeuArgValGlyGly 

AsnLysLysGlyllePheThrArgAspArgLysProLyBSerAlaAlaPheLeu 

LeuGlnLysArgTrpThrGlyMetAsnPheGlyGluLyeProGlnGlnGlyGly 

LysGln 
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Table 10A 

Nucleotide sequence encoding fusion protein 590 



ATGTTACGTCCTGTAGAAACC 
TACAATGCAGGACATCTTTGG 

CCAACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTCAGTCTGGATCGC 
GGTTGGGCACTTTAGTTTTTTGAGATGCCGGACACCCGTAAGTCAGACCTAGCG 

GAACGCGTGGCCGATCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
CTTGCGCACCGGCTAGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AAT7VATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGA 
TTACTCAGGCTCTAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACCTCT 

AGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCC 
TCACTTAATATATTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGG 

ACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGA 
TGGTTCCGTTTCTCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCT 
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Table 10A (cont.) 



gctttgttccttgggttcttgggagcagcaggaagcactatgggcgcagcgtca 
cgaaacaaggaacccaagaaccctcgtcgtccttcgtgatacccgcgtcgcagt 

atgacgctgacggtacaggccagacaattattgtctggtasaotgcagcagcag 
tactgcgactgccatgtccggtctgttaataacagaccatAtcacgtcgtcgtc 

aacaatttgctgagggctattgaggcgcaacagcatctgttgcaactcacagtc 
ttgttaaacgactcccgataactccgcgttgtcgtagacaacgttgagtgtcag 

tggggcatcaagcagctccaggcaagaatcctggctgtggaaagatacctaaag 
accccgtagttcgtcgaggtccgttcttaggaccgacacctttctatggatttc 

gatcaacagctcctggggatttggggttgctctggaaaactcatttgcaccact 
ctagttgtcgaggacccctaaaccccaacgagaccttttgagtaaacgtggtga 

gctgtgccttggaatgctagttggagtaataaatctctggaacagatttggaat 
cgacacggaaccttacgatcaacctcattatttagagaccttgtctaaacctta 

aacatgacctggatggagtgggacagagaaattaacaattacacaagcttcccg 
ttgtactggacctacctcaccctgtctctttaattgttaatgtgttcgaagggc 

atccatcgcagcgtaatgctctacaccacgccgaacacct6ggtggacgatatc 
taggtagcgtcgcattacgagatgtggtgcggcttgtggacccacctgctatag 

accgtggtgacgcatgtcgcgcaagactgtaaccacgcgtctgttgactggcag 
tggcaccactgcgtacagcgcgttctgacattggtgcgca^acaactgaccgtc 

gtggtggccaatggtgatgtcagcgttgaactgcgtgatgcggatcaacaggtg 
caccaccggttaccactacagtcgcaacttgacgcactacgcctagttgtccac 

gttgcaactggacaaggcactagcgggactttgcaagtggtgaatccgcacctc 
caacgttgacctgttccgtgatcgccctgaaacgttcaccacttaggcgtggag 

tggcaaccgggtgaaggttatctctatgaactgtgcgtcacagccaaaagccag 
accgttggcccacttccaatagagatacttgacacgcagtgtcggttttcggtc 

acagagtgtgatatctacccgcttcgcgtcggcatccggtcagtggcagtgaag 
tgtctcacactatagatgggcgaagcgcagccgtaggccagtcagcgtcacttc 

ggcgaacagttcctgattaaccacaaaccgttctactttactggctttggtcgt 
ccgcttgtcaaggactaattggtgtttggcaagatgaaatgaccgaaaccagca 

CATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTGCTGATGGTGCAC 
GCACTTCTACGCCTGAACGCACCGTTTCCTAAGCTATTGCACGACTACCACGTG 

GACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACCTCGCATTAC 
CTGGTGCGTAATTACCTGACCTAACCCCGGTTGAGGATGGCATGGAGCGTAATG 
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Table 10A (cont.) 



CCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGTGATT 
GGAATGCGACTTCTCTACGAGCTGACCCGTCTACTTGTACCGTAGCACCACTAA 

GATGAAACTGCTGCTGTCGGCTTTAACCTCTCTTTAGGCATTGGTTTCGAAGCG 
CTACTTTGACGACGACAGCCGAAATTGGAGAGAAATCCGTAACCAAAGCTTCGC 

GGCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAG 
CCGTTGTTCGGCTTTCTTGACATGTCGCTTCTCCGTCAGTTGCCCCTTTGAGTC 

CAAGCGCACTTACAGGCGATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCA 
GTTCGCGTGAATGTCCGCTAATTTCTCGACTATCGCGCACTGTTTTTGGTGGGT 

AGCGTGGTGATGTGGAGTATTGCCAACGAACCGGATACCCGTCCGCAAGGTGCA 
TCGCACCACTACACCTCATAACGGTTGCTTGGCCTATGGGCAGGCGTTCCACGT 

CGGGAATATTTCGCGCCACTGGCGGAAGCAACGCGTAAACTCGACCCGACGCGT 
GCCCTTATAAAGCGCGGTGACCGCCTTCGTTGCGCATTTGAGCTGGGCTGCGCA 

CCGATCACCTGCGTCAATGTAATGTTCTGCGACGCTCACACCGATACCATCAGC 
GGCTAGTGGACGCAGTTACATTACAAGACGCTGCGAGTGTGGCTATGGTAGTCG 

GATCTCTTTGATGTGCTGTGCCTGAACCGTTATTACGGATGGTATGTCCAAAGC 
CTAGAGAAACTACACGACACGGACTTGGCAATAATGCCTACCATACAGGTTTCG 

GGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACTTCTGGCCTGGCAG 
CCGCTAAACCTTTGCCGTCTCTTCCATGACCTTTTTCTTGAAGACCGGACCGTC 

GAGAAACTGCATCAGCCGATTATCATCACCGAATACGGCGTGGATACGTTAGCC 
CTCTTTGACGTAGTCGGCTAATAGTAGTGGCTTATGCCGCACCTATGCAATCGG 

GGGCTGCACTCAATGTACACCGACATGTGGAGTGAAGAGTATCAGTGTGCATGG 
CCCGACGTGAGTTACATGTGGCTGTACACCTCACTTCTCATAGTCACACGTACC 

CTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGGTGAACAG 
GACCTATACATAGTGGCGCAGAAACTAGCGCAGTCGCGGCAGCAGCCACTTGTC 

GTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCGGT 
CATACCTTAAAGCGGCTAAAACGCTGGAGCGTTCCGTATAACGCGCAACCGCCA 

AACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTTCTG 
TTGTTCTTTCCCTAGAAGTGAGCGCTGGCGTTTGGCTTCAGCCGCCGAAAAGAC 

CTGCAAAAACGCTGGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGC 
GACGTTTTTGCGACCTGACCGTACTTGAAGCCACTTTTTGGCGTCGTCCCTCCG 

AAACAA 
TTTGTT 



Table 11 

Amino acid sequence of fusion protein KH1 

. MetLeuArg 

ProValGluThrProThrArgGluIleLysLyaLeuAspGlyLeuTrpAlaPhe 
SerLeuAspArgGluArgGluPheProValTrpLysGluAlaThrThrThrLeu 
PheCyBAraSerAspAlaLysAlaTyrAspThrGluValHi^AanValTrpAla 
ThrHisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsn 
ValThrGluAsnPheAsnMetTrpLysAsnAspMetValGluGlnMetHisGlu 
AspIlelleSerLeuTrpAspGlnSerLeuLysProCyBVaXLyeLeuThrPro 
LeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrABnThrAsnSer 
SerSerGlyArgMetlleMetGluLysGlyGluIleLysAsnCysSerPheAsn 
IleSerThrSerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLys 

i 

LeuAspIlelleProIleAspAsnAspThrThrSerTyrThrLeuThrSerCys 
AsnThrSerVallleThrGlnAlaCysProLysValSerPbeGluProIlePro 
IleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCyBThrHisGly 
IleArgProValValSerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGlu 
GluValVallleArgSerAlaAsnPheThrAspAsnAlaLysThrllelleVal 
GlnLeuAsnGlnSerValGlulleAsnCysThrArgProAsnAsnAsnThrArg 
LysSerlleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLys 

IleGlyAsnMetArgGlnAlaHisCysAsnlleSerArgAlaLyBTrpAsnAsn 

i. 

ThrLeuLysGlnlleAspSerLysLeuArgGluGlnPheGlyAsnAsnLysThr 
IlellePheLysGlnSerSerGlyGlyAspProGluIleValThrHisSerPhe 
AsnCysGlyGlyGluPhePheTy rCy sAsnSe rThrGlnLeuPheAanSe rTh r 
TrpPheAsnSerThrTrpSerThrLysGlySerAsnAsnThrGluGlySerAsp 
ThrlleThrLeuProCyBArglleLysGlnllelleAsnMetTrpGlnGluVal 
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Table 11 (cont.) 

GlyLysAlaMetTyrAlaProProIleSerGlyGlnlleArgCysSerSerAsn 

IleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnSerAsnAsnGluSerGlu 

IlePheArgProGlyGlyGlyAspMetArgAspAsnTrpArgSerGluLeuTyr 

LysTyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLys 

ArgArgValValGlnArgGluLysArgAlaValGlylleGlyAlaLeuPheLeu 

GlyPheLeuGlyAlaAlaGlySerThrMetGlyAlaAlaSerMetThrLeuThr 

ValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsnLeuLeu 

ArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLys 

GlnLeuGlnAlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeu 

LeuGlylleTrpGlyCysSerGlyLysLeuIleCysThrThrAlaValProTrp 

AsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrpAsnAsnMetThrTrp 

MetGluTrpAspArgGluIleAsnAsnTyrThrSerPheProGlyAlaArglle 

LeuGluAspGluArgAlaSer 
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Table HA 

Nucleotide sequence encoding fuBion protein KH1 



: ATGTTACGT 
/ TACAATGCA 

CCTGTAGAAACCCCAACCCGTGAAATCAAAAAACtfCGACGGCCtGTGGGCATTC 
GGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGACACCCGTAAG 

AGTCTGGATCGCGAACGCGAATTCCCTGTGTGGAAGGAAGCAACCACCACTCTA 
TCAGACCTAGCGCTTGCGCTTAAGGGACACACCTTCCTTCGTTGGTGGTGAGAT 

TTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCC 
AAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTATTACAAACCCGG 

ACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAOtt»GTATTGGTAAAT 
TGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCATCATAACCATTTA 

GTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAACAGATGCATGAG 

cactgtcttttaaaattgtacacctttttactgtaccatcttgrctacgtactgi 

gatataatcagtttatgggatcaaagcctaaagccatgtgtaaaattaacccca 
ctatattagtcaaataccctagtttcggatttcggtacacattttaattggggt 

ctctgtgttagtttaaagtgcactgatttgaagaatgatactaataccaatagt 
gagacacaatcaaatttcacgtgactaaacttcttactatgattatggttatca 

agtagcgggagaatgataatggagaaaggagagataaaaaactgctctttcaat 
tcatcgccctcttactattacctctttcctctctattttttgacgagaaagtta. 

atcagcacaagcataagaggtaaggtgcagaaagaatatgcatttttttataaa 
tagtcgtgttcgtattctccattccacgtctttcttatacgtaaaaaaatattt 

cttgatataataccaatagataatgatactaccagctataccttgagaagttgt 
gaactatattatggttatctattactatgatggtcgatatgcaactgttcaaca 

aacacctcagtcattacacaggcctgtccaaaggtatcctttgagccaattccc 
ttgtggagtcagtaatgtgtccggacaggtttccataggaaactcggttaaggg 

atacattattgtgccccggctggttttgcgattctaaaatgtaataataagacg 

TATGTAATAACACGGGGCCGACCAAAACGCTAAGATTTTACATTATTATTCTGC 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGA 
AAGTTACCTTGTCCTGGTACATGTTTACAGTCGTGTCATGTTACATGTGTACCT 

ATTAGGCCAGTAGTATCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAA 
TAATCCGGTCATCATAGTTGAGTTGACGACAATTTACCGTCAGATCGTCTTCTT: 

GAGGTAGTAATTAGATCTGCCAATTTCACAGACAATGCTAAAACCATAATAGTA 
CTCCATCATTAATCTAGACGGTTAAAGTGTCTGTTACGATTTTGGTATTATCAT: 
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Table 11A (cont.) 



CAGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAACAACAATACAAGA 
GTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTGTTGTTATGTTCT 

AAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTTACAATAGGAAAA 
TTTTCATAGGCATAGGTGTCTCCTGGTCCCTCTCGTAAACAATGTTATCCTTTT 

ATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATAAC 
TATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGTTTTACCTTATTG 

ACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGAAATAATAAAACA 
TGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCTTTATTATTTTGT 

ATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTTTT 
TATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACATTGCGTGTCAAAA 

AATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGTACT 
TTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGACAAATTATCATGA 

TGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACTGAAGGAAGTGAC 
ACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGACTTCCTTCACTG 

ACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATGTGGCAGGAAGTA 
TGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTACACCGTCCTTCAT 

GGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGATGTTCATCAAAT 
CCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCTACAAGTAGTTTA 

ATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAACAATGAGTCCGAG 
TAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTGTTACTCAGGCTC 

ATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATAT 
TAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACCTCTTCACTTAATATA 

AAATATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAG 
TTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGGTGGTTCCGTTTC 

AGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT 
TCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCTCGAAACAAGGAA 

GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATGACGCTGACG 
CCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGTTACTGCGACTGC 

GTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAATTTGCTG 
CATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTCTTGTTAAACGAC 

AGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAG 
TCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAGACCCCGTAGTTC 

CAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTC 
GTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTCCTAGTTGTCGAG 
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Table 11A (cont.) 



CTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGG 
GACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGACGACACGGAACC 

AATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATAACATGACCTGG 
TTACGATCAACCTCATTATTTAGAGACCTTGTCTAAACCTTATTG7?ACTGGACC 

ATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTCCCGGGAGCTCGAATT 
TACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGAAGGGCCCTCGAGCTTAA 

CTTGAAGACGAAAGGGCCTCG 
G AACTTCTGCTTTCC CGGAGC 
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CLAIMS 



1. A recombinant DNA transfer vector comprising 
DNA having all or part of the following nucleotide 
sequence or equivalent nucleotide sequences containing 
bases whose translated region codes for HTLV-III 
envelope protein fragment denoted RIO: 

ATGTTACGT 
TACAATGCA 

CCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTC 
GGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGACACCCGTAAG 

AGTCTGGATCGCGAAAACTGTGGAATTGATCAATTCCCTGTGTGGAAGGAAGCA 
TCAGACCTAGCGCTTTTGACACCTTAACTAGTTAAGGGACACACCTTCCTTCGT 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACAT 
TGGTGGTGAGATAAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTA 

AATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 
TTACAAACCCGGTGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCAT 

GTATTGGTAAATGTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAA 
CATAACCATTTACACTGTCTTTTAAAATTGTACACCTTTTTACTGTACCATCTT 

CAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTA 
GTCTACGTACTCCTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACAT 

AAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACT 
TTTAATTGGGGTGAGACACAATCAAATTTCACGTGACTAAACTTCTTACTATGA 

AATACCAATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAAC 
TTATGGTTATCATCATCGCCCTCTTACTATTACCTCTTTCCTCTCTATTTTTTG 

TGCTCTTTCAATATCAGCACAAGCATAAGAGGTAAGGTGCAGAAAGAATATGCA 
ACGAGAAAGTTATAGTCGTGTTCGTATTCTCCATTCCACGTCTTTCTTATACGT 

TTTTTTTATAAACTTGATATAATACCAATAGATAATGATACTACCAGCTATACG 
AAAAAAATATTTGAACTATATTATGGTTATCTATTACTATGATGGTCGATATGC 

TTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTT 
AACTGTTCAACATTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCCATAGGAAA. 

GAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGT 
CTCGGTTAAGGGTATGTAATAACACGGGGCCGACCAAAACGCTAAGATTTTACA 

AATAATAAGACGTTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTAGAA 
TTATTATTCTGCAAGTTACCTTGTCCTGGTACATGTTTACAGTCGTGTCATGTT; 
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3 U TGTACACATGGAATTAGGCCAGTAGTATCAACTCAACTGCTGTTAAATGGC AGT 

35 ACATGTGTACCTTAATCCGGTCATCATAGTTGAGTTGACGACAATTTACCGTCA 

36 CTAGCAGAAGAAGAGGTAGTAATTAGATCTGCCAATTTCACAGACAATGCTAAA 

37 GATCGTCTTCTTCTCCATCATTAATCTAGACGGTTAAAGTGTCTGJTACGATTT 

38 ACCATAATAGTACAGCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
3 9 TGGTATTATCATGTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

40 AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 

A 1 TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 

A 2 ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGT AACATTAGTAGAGC A 

A3 TGTTATCCTTTTTATCCTTTATACTGTGTTCGTGTAACATTGTAATCATCTCGT 

A A AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 

A 5 TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGOTAAACCT 

A 6 AATAATAAAACAATAATCTTTAAGC AGTCCTCAGGAGGGGACCC AGAAATTGTA 

A7 TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTC^TTAACAT 

A8 ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 

A 9 TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

50 TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 

5 1 AAATTATC^TGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

52 GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 

53 CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

54 TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 

55 ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

56 TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 

5 7 ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

58 AATGAGTCCGAGATCCATCGCAGCGTAATGCTCTACACCACGCCGAACACCTGG 

59 TTACTCAGGCTCTAGGTAGCGTCGCATTACGAGATGTGGTGCGGCTTGTGGACC 

60 GTGGACGATATCACCGTGGTGACGCATGTCGCGCAAGACTGTAACCACGCGTCT 

61 CACCTGCTATAGTGGCACCACTGCGTACAGCGCGTTCTGACATTGGTGCGCAGA 

62 GTTGACTGGCAGGTGGTGGCCAATGGTGATGTCAGCGTTGAACTGCGTGATGCG 

63 CAACTGACCGTCCACCACCGGTTACCACTACAGTCGCAACTTGACGCACTACGC 

6 A GATCAACAGGTGGTTGCAACTGGACAAGGCACTAGCGGGACTTTGCAAGTGGTG 

65 CTAGTTGTCCACCAACGTTGACCTGTTCCGTGATCGCCCTGAAAGG^^XCCAC 

66 AATCCGCACCTCTGGCAACCGGGTGAAGGTTATCTCTATGAACTGTGCGTCACA 

67 TTAGGCGTGGAGACCGTTGGCCCACTTCCAATAGAGATACTTGACAeGCAGTGT 

68 GCCAAAAGCCAGACAGAGTGTGATATCTACCCGCTTCGCGTCGGCATCCGGTCA 
6 f CGGTTTTCGGTCTGTCTCACACTATAGATGGGCGAAGCGCAGCCGTAGGCGAGT 
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70 GTGGCAGTGAAGGGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTACT 

71 CACCGTCACTTCCCGCTTGTCAAGGACTAATTGGTGTTTGGCAAGATGAAATGA 

72 GGCTTTGGTCGTCATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTG 

73 CCGAAACCAGCAGCACTTCTACGCCTGAACGCACCGTTTCCTAAGCTATTGCAC 

74 CTGATGGTGCACGACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGT 
7 5 GACTACCACGTGCTGGTGCGTAATTACCTGACCTAACCCCGGTTGAGGATGGCA 

76 ACCTCGCATTACCCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGC 

77 TGGAGCGTAATGGGAATGCGACTTCTCTACGAGCTGACCCGTCTACTTGTACCG 

78 ATCGTGGTGATTGATGAAACTGCTGCTGTCGGCTTTAACCTCTCTTTAGGCATT 

79 TAGCACCACTAACTACTTTGACGACGACAGCCGAAATTGGAGAGAAATCCGTAA 

80 GGTTTCGAAGCGGGCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAAC 

81 CCAAAGCTTCGCCCGTTGTTCGGCTTTCTTGACATGTCGCTTCTCCGTCAGTTG 

82 GGGGAAACTCAGCAAGCGCACTTACAGGCGATTAAAGAGCTGATAGCGCGTGAC 

83 CCCCTTTGAGTCGTTCGCGTGAATGTCCGCTAATTTCTCGACTATCGCGCACTG 

84 AAAAACCACCCAAGCGTGGTGATGTGGAGTATTGCCAACGAACCGGATACCCGT 

85 TTTTTGGTGGGTTCGCACCACTACACCTCATAACGGTTGCTTGGCCTATGGGCA 

86 CCGCAAGGTGCACGGGAATATTTCGCGCCACTGGCGGAAGCAACGCGTAAACTC 

87 GGCGTTCCACGTGCCCTTATAAAGCGCGGTGACCGCCTTCGTTGCGCATTTGAG 

88 GACCCGACGCGTCCGATCACCTGCGTCAATGTAATGTTCTGCGACGCTCACACC 

8 9 CTGGGCTGCGCAGGCT AGTGG ACGCAGTTACATTACAAG ACGCTGCGAGTGTGG 

90 GATACCATCAGCGATCTCTTTGATGTGCTGTGCCTGAACCGTTATTACGGATGG 

91 CTATGGTAGTCGCTAGAGAAACTACACGACACGGACTTGGCAATAATGCCTACC 

92 TATGTCCAAAGCGGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACTT 

93 ATACAGGTTTCGCCGCTAAACCTTTGCCGTCTCTTCCATGACCTTTTTCTTGAA 

94 CTGGCCTGGCAGGAGAAACTGCATCAGCCGATTATCATCACCGAATACGGCGTG 

95 GACCGGACCGTCCTCTTTGACGTAGTCGGCTAATAGTAGTGGCTTATGCCGCAC 

96 GATACGTTAGCCGGGCTGCACTCAATGTACACCGACATGTGGAGTGAAGAGTAT 

97 CTATGCAATCGGCCCGACGTGAGTTACATGTGGCTGTACACCTCACTTCTCATA 

98 CAGTGTGCATGGCTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTC 

9 9 GTCACACGTACCGACCT ATACATAGTGGCGCAGAAACTAGCGCAGTCGCGGCAG 

100 GTCGGTGAACAGGTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGC ATATTG 

101 CAGCCACTTGTCCATACCTTAAAGCGGCTAAAACGCTGGAGCGTTCCGTATAAC 

102 CGCGTTGGCGGTAACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCG 

103 GCGCAACCGCCATTGTTCTTTCCCTAGAAGTGAGCGCTGGCGTTTGGCTTCAGC 

104 GCGGCTTTTCTGCTGCAAAAACGCTGGACTGGCATGAACTTCGGTGAAAAACCG 

105 CGCCGAAAAGACGACGTTTTTGCGACCTGACCGTACTTGAAGCCACTTTTTGGC 

106 CAGC AGGGAGGCAAACAA 

107 GTCGTCCCTCCGTTTGTT. 
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2. A recombinant DNA transfer vector comprising 
DNA having all or part of the following nucleotide 
sequence or equivalent nucleotide sequences containing 
bases whose translated region codes for HTLV-III 
envelope protein fragment denoted PB1: 



6 ATGTTACGTCCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTG 

7 TACAATGCAGGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGAC 

8 TGGGCATTCAGTCTGGATCGCGAACGCGTGGCCGATCTGAACCAATCTGTAGAA 

9 ACCCGTAAGTCAGACCTAGCGCTTGCGCACCGGCTAGACTTGGTTAGACATCTT 

10 ATTAATTGTACAAGACCCAACAACAATACAAGAAAAAGTATCCGtATCCAGAGA 

1 1 TAATTAACATGTTCTGGGTTGTTGTTATGTTCTTTTTCATAGGCATAGGTCTCT 

12 GGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGCA 

13 CCTAATCCCTCTCGTAAACAATGTTATCCTTTTTATCCTTTATACTCTGTTCGT 

1 4 CATTGTAACATTAGTAGAGCAAAATGGAATAACACTTTAAAACAGATAGATAGC 

15 GTAACATTGTAATCATCTCGTTTTACCTTATTGTGAAATTTTGTCTATCTATCG 

1 6 AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAGTCCTCA 

17 TTTAATTCTCTTGTTAAACCTTTATTATTTTGTTATTAGAAATTCGTCAGGAGT 

1 8 GGAGGGGACCCAGAAATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTG 

1 9 CCTCCCCTGGGTCTTTAACATTGCGTGTCAAAATTAACACCTCCCCTTAAAAAG 

2 0 T ACTGT AATTCAAC ACAACTGTTT AATAGTACTTGGTTTAATAGiACTTGG AGT 

21 ATGACATTAAGTTGTGTTGACAAATTATCATGAACCAAATTATCATGAACCTCA 

22 ACTAAAGGGTCAAATAACACTGAAGGAAGTGACACAATCAGCCTCCCATGCAGA 

23 TGATTTCCCAGTTTATTGTGACTTCCTTCACTGTGTTAGTGGGAiSGGTACGTCT 

2 A ATAAAACAAATTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCT 

2 5 TATTTTGTTTAATATTTGTACACCGTCCTTCATCCTTTTCGTTACATACGGGGA 

26 CGCATCAGTGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACA 

2 7 GGGTAGTCACCTGTTTAATCTACAAGTAGTTTATAATGTCCCGACGAJPAATTGT 

28 AGAGATGGTGGTAATAGCAACAATGAGTCCGAGATCCGTCGACAAGCTTCCCGG 

29 TCTCTACCACCATTATCGTTGTTACTCAGGCTCTAGGCAGCTGTTCGAAGGGGC 

30 GAGCTCGAATTCTTGAAGACGAAAGGGCCTCGTGATACTCCTATTTTTATAGGT 

3 1 CTCG AGCTT AAG AACTTCTGCTTTCCCGG AGCACTATGCGGATAAAAATATCCA f 
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3. A recombinant DNA transfer vector comprising 
DNA having all or part of the following nucleotide 
sequence or equivalent nucleotide sequences containing 
bases whose translated region codes for HTLV-III 
envelope protein fragment denoted 590: 

ATGTTACGTCCTGTAGAAACC 
TACAATGCAGGACATCTTTGG 

CCAACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTCAGTCTGGATCGC 
GGTTGGGCACTTTAGTTTTTTGAGATGCCGGACACCCGTAAGTCAGACCTAGCG 

GAACGCGTGGCCGATCTGAACCAATCTGTAGAAATTAATTGTACAAGACCCAAC 
CTTGCGCACCGGCTAGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTG 

AACAATACAAGAAAAAGTATCCGTATCCAGAGAGGACCAGGGAGAGCATTTGTT 
TTGTTATGTTCTTTTTCATAGGCATAGGTCTCTCCTAATCCCTCTCGTAAACAA 

ACAATAGGAAAAATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCA 
TGTTATCCTTTTTATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGT 

AAATGGAATAACACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGA 
TTTACCTTATTGTGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCT 

AATAATAAAACAATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTA 
TTATTATTTTGTTATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACAT 

ACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTG 
TGCGTGTCAAAATTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGAC 

TTTAATAGTACTTGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACT 
AAATTATCATGAACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGA 

GAAGGAAGTGACACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG 
CTTCCTTCACTGTGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTAC 

TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGA 
ACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAGTCACCTGTTTAATCT 

TGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAAC 
ACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTG 

AATGAGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGA 
TTACTCAGGCTCTAGAAGTCTGGACCTCCTCCTCTATACTCGCTGTTAACCTCT 

AGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCC 
TCACTTAATATATTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGG 

ACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGA 
TGGTTCCGTTTCTCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCT 
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GCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCA 
CGAAACAAGGAACCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGT 

ATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTASAGTGCAGCAGCAGi 
TACTGCGACTGCCATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTC 

AACAATTTGCTGAGGGCTATTGAGGCGCAAC AGCATCTGTtGCAACTC ACAGTC 
TTGTTAAACGACTCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAG 

TGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAG 
ACCCCGTAGTTCGTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTC 

GATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACT 
CTAGTTGTCGAGGACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGA 

GCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAAT 
CGACACGGAACCTTACGATCAACCTCATTATTTAG AGACCTTGTCTAAACCTTA 

AACATGACCTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTCCCG 
TTGTACTGGACCTACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGAAGGGC ? 

ATCCATCGCAGCGTAATGCTCTACACCACGCCGAACACCTGGGTGGACGATATC : ' 
TAGGTAGCGTCGCATTACGAGATGTGGTGCGGCTTGTGGACCCACCTGCTATAG 

ACCGTGGTGACGCATGTCGCGCAAGACTGTAACCACGCGTCTGTTGACTGGCAG 
TGGCACCACTGCGTACAGCGCGTTCTGACATTGGTGCGCAGACAACTGACCGTC: 

GTGGTGGCCAATGGTGATGTCAGCGTTGAACTGCGTGATGCGGATCAACAGGTG 
CACCACCGGTTACCACTACAGTCGCAACTTGACGCACTACGCCTAGTTGTCCAC 

GTTGCAACTGGACAAGGCACTAGCGGGACTTTGCAAGTGGTGAATCCGCACCTC 
CAACGTTGACCTGTTCCGTGATCGCCCTGAAACGTTCACCACTTAGGCGTGGAG 

TGGCAACCGGGTGAAGGTTATCTCTATGAACTGTGCGTCACAGCCAAAAGCCAG 
ACCGTTGGCCCACTTCCAATAGAGATACTTGACACGCAGTGTCGGTTTTCGGTC 

ACAGAGTGTGATATCTACCCGCTTCGCGTCGGCATCCGGTCAGTGGCAGTGAAG 
TGTCTCACACTATAGATGGGCGAAGCGCAGCCGTAGGCCAGTCACCGTCACTTC 

GGCGAACAGTTCCTGATTAACCACAAACCGTTCTACTTTACTGGCTTTGGTCGT 
CCGCTTGTCAAGGACTAATTGGTGTTTGGCAAGATGAAATGACCGAAACCAGCA 

CATGAAGATGCGGACTTGCGTGGCAAAGGATTCGATAACGTGCTGATGGTGCAC 
GCACTTCTACGCCTGAACGCACCGTTTCCTAAGCTATTGCACGACTACCACGTG 

GACCACGCATTAATGGACTGGATTGGGGCCAACTCCTACCGTACCTCGCATTAC 
CTGGTGCGTAATTACCTGACCTAACCCCGGTTGAGGATGGCATGGAGCGTAATG 

CCTTACGCTGAAGAGATGCTCGACTGGGCAGATGAACATGGCATCGTGGTGATT 
GGAATGCGACTTCTCTACGAGCTGACCCGTCTACTTGTAGCGTAGCACCACTAA 

GATGAAACTGCTGCTGTCGGCTTTAACCTCTCTTTAGGCATTGGTTTCGAAGCG 
CTACTTTGACGACGACAGCCGAAATTGGAGAGAAATCCGTAACCAAAGCTTCGC 
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72 GGCAACAAGCCGAAAGAACTGTACAGCGAAGAGGCAGTCAACGGGGAAACTCAG 

73 CCGTTGTTCGGCTTTCTTGACATGTCGCTTCTCCGTCAGTTGCCCCTTTGAGTC 

74 CAAGCGCACTTACAGGCGATTAAAGAGCTGATAGCGCGTGACAAAAACCACCCA 

75 GTTCGCGTGAATGTCCGCTAATTTCTCGACTATCGCGCACTGTTTTTGGTGGGT 

76 AGCGTGGTGATGTGGAGTATTGCCAACGAACCGGATACCCGTCCGCAAGGTGCA 

77 TCGCACCACTACACCTCATAACGGTTGCTTGGCCTATGGGCAGGCGTTCCACGT 

78 CGGGAATATTTCGCGCCACTGGCGGAAGCAACGCGTAAACTCGACCCGACGCGT 

79 GCCCTTATAAAGCGCGGTGACCGCCTTCGTTGCGCATTTGAGCTGGGCTGCGCA 

80 CCGATCACCTGCGTCAATGTAATGTTCTGCGACGCTCACACCGATACCATCAGC 

81 GGCTAGTGGACGCAGTTACATTACAAGACGCTGCGAGTGTGGCTATGGTAGTCG 

8 2 GATCTCTTTGATGTGCTGTGCCTGAACCGTTATTACGGATGGTATGTCCAAAGC 

83 CTAGAGAAACTACACGACACGGACTTGGCAATAATGCCTACCATACAGGTTTCG 

84 GGCGATTTGGAAACGGCAGAGAAGGTACTGGAAAAAGAACTTCTGGCCTGGCAG 

85 CCGCTAAACCTTTGCCGTCTCTTCCATGACCTTTTTjCTTGAAGACCGGACCGTC 

§6 GAGAAACTGCATCAGCCGATTATCATCACCGAATACGGCGTGGATACGTTAGCC 

' CTCTTTGACGTAGTCGGCTAATAGTAGTGGCTTATGCCGCACCTATGCAATCGG 

88 GGGCTGCACTCAATGTACACCGACATGTGGAGTGAAGAGTATCAGTGTGCATGG 

89 CCCGACGTGAGTTACATGTGGCTGTACACCTCACTTCTCATAGTCACACGTACC 

90 CTGGATATGTATCACCGCGTCTTTGATCGCGTCAGCGCCGTCGTCGGTGAACAG 

91 GACCTATACATAGTGGCGCAGAAACTAGCGCAGTCGCGGCAGCAGCCACTTGTC 

92 GTATGGAATTTCGCCGATTTTGCGACCTCGCAAGGCATATTGCGCGTTGGCGGT 

93 CATACCTTAAAGCGGCTAAAACGCTGGAGCGTTCCGTATAACGCGCAACCGCCA 

94 AACAAGAAAGGGATCTTCACTCGCGACCGCAAACCGAAGTCGGCGGCTTTTCTG 

95 TTGTTCTTTCCCTAGAAGTGAGCGCTGGCGTTTGGCTTCAGCCGCCGAAAAGAC 

96 CTGGAAAAACGCTGGACTGGCATGAACTTCGGTGAAAAACCGCAGCAGGGAGGC 

97 GACGTTTTTGCGACCTGACCGTACTTGAAGCCACTTTTTGGCGTCGTCCCTCCG 

98 AAACAA 

99 TTTGTT. 



1 4. A recombinant DNA transfer vector comprising 

2 DNA having all or part of the following nucleotide 

3 sequence or equivalent nucleotide sequences containing 

4 bases whose translated region codes for HTLV-III 

5 envelope protein fragment denoted KHl: 
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6 ATGTTACGT 

7 TACAATGCA 

8 CCTGTAGAAACCCCAACCCGTGAAATCAAAAAACTCGACGGCCTGTGGGCATTC 

9 GGACATCTTTGGGGTTGGGCACTTTAGTTTTTTGAGCTGCCGGACACCCGTAAG 

10 AGTCTGGATCGCGAACGCGAATTCCCTGTGTGGAAGGAAGCAACCACCACTCTA 

1 1 TCAGACCTAGCGCTTGCGCTTAAGGGACACACCTTCCTTCGTTGGTGGTGAGAT 

12 TTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCC 

1 3 AAAACACGTAGTCTACGATTTCGTATACTATGTCTCCATGTATTACAAACCCGG : 

14 ACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAAT 

1 5 TGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTTCATCATAACCATTTA 

16 GTGACAGAAAATTTTAACATGTGGAAAAATGACATGGTAGAACAGATGCATGAG 

17 CACTGTCTTTTAAAATTGTACACCTTTTTACTGTACCATCTTGTCTACGTACTC 

1 8 GATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCCCA 

1 9 CTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACATTTTAATTGGGGT 

20 CTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACCAATAGT 

2 1 GAGACACAATCAAATTTCACGTGACTAAACTTCTTACTATGATTATGGTTATCA 

22 AGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAAGTGCTCTTTCAAT 

23 TCATCGCCCTCTTACTATTACCTCTTTCCTCTCTATTTTTTGACGAGAAAGTTA 

2 4 ATCAGCACAAGCATAAG AGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAA 

25 TAGTCGTGTTCGTATTCTCCATTCCACGTCTTTCTTATACGTAAAAAAATATTT 

2 6 CTTGAT ATAAT ACCAAT AGATAATGAT ACT ACC AGCTATACGTTGACAAGTTGT 
2.7 GAACTATATTATGGTTATCTATTACTATGATGGTCGATATGCAACTGTTCAACA 

28 AACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTTGAGCCAATTCCC 

29 TTGTGGAGTCAGTAATGTGTCCGGACAGGTTTCCATAGGAAACTCGGTTAAGGG s 

30 ATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 

31 TATGTAATAACACGGGGCCGACCAAAACGCTAAGATTTTACATTATTATTCTGC 

32 TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGA 

33 AAGTTACCTTGTCCTGGTACATGTTTACAGTCGTGTCATGOTACATGTGTACCT 

3 a attaggccagtagtatcaactcaactgctgttaaatggcagtctA6cagaagaa 
3 5 taatccggtcatcat agttgagttgacgacaatttaccgtcagatcgtcttctt 

56 gaggtagtaattagatctgccaatttcacagacaatgctaaaaccataatagta " 

3° ctccatcattaatctagacggttaaagtgtctgotacgattttggtattatcat 

3 8 C AGCTGAACCAATCTGTAGAAATT AATTGT ACAAG ACCCAACAACAATAC AAGA 
39 GTCGACTTGGTTAGACATCTTTAATTAACATGTTCTGGGTTGTTGTTATGTTCT 

4 0 AAAAGTATCCGTATCC AGAG AGGACC AGGG AG AGCATTTGTTACAATAGGAAAA 
41 TTTTCATAGGC^TAGGTCTCTCCTGGTCCCTCTCGTAAACAATGTTATCCTTTT , 
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A 2 ATAGGAAATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATAAC 

A3 TATCCTTTATACTCTGTTCGTGTAACATTGTAATCATCTCGTTTTACCTTATTG 

A A ACTTTAAAACAGATAGATAGCAAATTAAGAGAACAATTTGGAAATAATAAAACA 

A5 TGAAATTTTGTCTATCTATCGTTTAATTCTCTTGTTAAACCTTTATTATTTTGT 

A6 ATAATCTTTAAGCAGTCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTTTT 

A7 TATTAGAAATTCGTCAGGAGTCCTCCCCTGGGTCTTTAACATTGCGTGTCAAAA 

A8 AATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGTACT 

49 TTAACACCTCCCCTTAAAAAGATGACATTAAGTTGTGTTGACAAATTATCATGA 

50 TGGTTTAATAGTACTTGGAGTACTAAAGGGTCAAATAACACTGAAGGAAGTGAC 

51 ACCAAATTATCATGAACCTCATGATTTCCCAGTTTATTGTGACTTCCTTCACTG 

5 2 ACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATGTGGCAGGAAGTA 

53 TGTTAGTGGGAGGGTACGTCTTATTTTGTTTAATATTTGTACACCGTCCTTCAT 

5 A GGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGATGTTCATCAAAT 

5 5 CCTTTTCGTTACATACGGGGAGGGT AGTCACCTGTTTAATCTACAAGTAGTTTA 

56 ATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAGCAACAATGAGTCCGAG 

57 TAATGTCCCGACGATAATTGTTCTCTACCACCATTATCGTTGTTACTCAGGCTC 

58 ATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATAT 

59 TAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACCTCTTCACTTAATATA 

6 0 AAATATAAAGTAGTAAAAATTG AACCATTAGGAGTAGCACCCACCAAGGCAAAG 

61 TTTATATTTCATCATTTTTAACTTGGTAATCCTCATCGTGGGTGGTTCCGTTTC 

62 AGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT 

63 TCTTCTCACCACGTCTCTCTTTTTTCTCGTCACCCTTATCCTCGAAACAAGGAA 

6A GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATGACGCTGACG 

65 CCCAAGAACCCTCGTCGTCCTTCGTGATACCCGCGTCGCAGTTACTGCGACTGC 

66 GTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAATTTGCTG 

67 CATGTCCGGTCTGTTAATAACAGACCATATCACGTCGTCGTCTTGTTAAACGAC 

68 AGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAG 

69 TCCCGATAACTCCGCGTTGTCGTAGACAACGTTGAGTGTCAGACCCCGTAGTTC 

70 CAGCTCCAGGCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTC 

71 GTCGAGGTCCGTTCTTAGGACCGACACCTTTCTATGGATTTCCTAGTTGTCGAG 

72 CTGGGGATTTGGGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGG 

7 3 GACCCCTAAACCCCAACGAGACCTTTTGAGTAAACGTGGTGACGACACGGAACC 

7 A AATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATAACATGACCTGG 

75 TTACGATCAACCTCATTATTTAGAGACCTTGTCTAAACCTTATTGTACTGGACC 

7 g ATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTCCCGGGAGCTCGAATT 

77 TACCTCACCCTGTCTCTTTAATTGTTAATGTGTTCGAAGGGCCCTCGAGCTTAA 

7 g CTTGAAGACGAAAGGGCCTCG 
79 . GAACTTCTGCTTTCCCGGAGC. 



-69- 



5. The DNA transfer vector of any preceding claim 
transferred to and replicated in a eukaryotic or 
prokaryotic host. 

6. A host transformed by the transfer vector of any of 
5 claims 1 to 4. 

7. HTLV-I11 envelope protein fragment denoted RIO 
having the following amino-acid sequence, or mutants 
thereof: 

MetLeuArg 

10 ProValGluThrProThrArgGluIleLysLysLeuAspGlyL.euTrpAlaPhe 
SerLeuAspArgGluAsnCysGlylleAspGlnPheProValTrpLysGluAla - r 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHis 
AsnValTrpAlaThrHisAlaCysValProThrAspProAsnProGlnGluVal 

1 5 ValLeuValAsnValThrGluAsnPheAsnMetTrpLysAsnAspMetValGlu 



25 



30 
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10 GlnMetHisGluAspIlelleSerLeuTrpAspGlnSerLeuLysProCysVal 

11 LysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThr 

12 AsnThrAsnSerSerSerGlyArgMetlleMetGluLysGlyGluIleLysAsn 

13 CysSerPheAsnlleSerThrSerlleArgGlyLysValGlnLysGluTyrAla 
1 U phePheTyrLysLeuAspIlelleProlleAspAsnAspThrThrSerTyrThr 

15 LeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSerPhe 

16 GluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCys 

17 AsnAsnLysThrPheAsnGlyThrGlyProCysThrAsnValSerThrValGln 

18 CysThrHisGlylleArgProValValSerThrGlnLeuLeuLeuAsnGlySer 

19 LeuAlaGluGluGluValVallleArgSerAlaAsnPheThrAspAsnAlaLys 

20 ThrllelleValGlnLeuAsnGlnSerValGluIleAsnCysThrArgProAsn 

21 AsnAsnThrArgLysSerlleArglleGlnArgGlyProGlyArgAlaPheVal 

22 ThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsnlleSerArgAla 

23 LysTrpAsnAsnThrLeuLysGlnlleAspSerLysLeuArgGluGlnPheGly 

2 U AsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGluIleVal 

25 ThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeu 

26 PheAsnSerThrTrpPheAsnSerThrTrpSerThrLysGlySerAsnAsnThr 

27 GluGlySerAspThrlleThrLeuProCysArglleLysGlnllelleAsnMet 

28 TrpGlnGluValGlyLysAlaMetTyrAlaProProIleSerGlyGlnlleArg 

29 CysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnSerAsn 

30 AsnGluSerGluIleHisArgSerValMetLeuTyrThrThrProAsnThrTrp 

31 valAspAspIleThrValValThrHisValAlaGlnAspCysAsnHisAlaSer 

32 ValAspTrpGlnValValAlaAsnGlyAspValSerValGluLeuArgAspAla 

33 AspGlnGlnValValAlaThrGlyGlnGlyThrSerGlyThrLeuGlnValVal 

3 A AsnProHisLeuTrpGlnProGlyGluGlyTyrLeuTyrGluLeuCysValThr 
35 AlaLysSerGlnThrGluCysAspIleTyrProLeuArgValGlylleArgSer 
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36 ValAlaValLysGlyGluGlnPheLeuIleAsnHiBLysProPheTyrPheThr 

37 GlyPheGlyArgHisGluAspAlaAspLeuArgGlyLyaGlyPheAspAsnVal 

38 LeuMetValHisABpHiBAlaLeuMetAspTrpIleGXyAlaAsnSerTyrArg 

39 ThrSerHisTyrProTyrAlaGluGluMetLeuA^pTrpAlaAspGluHiBGly 
AO IleValValHeAspGluThrAlaAlaValGlyPheAsnLeuSerLeuGlylle 

41 GlyPheGluAlaGlyAsnLysProLysGluLeuTyrSerGluGluAlaValA^n 

42 GlyGluThrGlnGlnAlaHisLeuGlnAlaIleLysGluLeuIleAlaArgA$p 

43 LysAsnHisProSerValValMetTrpSerlleAlaAsnGluProAspThrArg 

44 ProGlnGlyAlaArgGluTyrPheAlaProLeuAlaGluAlaThrArgLysLeu 

45 AspProThrArgProIleThrCysValABnValMetPheCysAapAlaHiBThr 

46 AspThrlleSerAspLeuPheAspValLeuCysLeuAshArgTyrTyrGlyTrp 

47 TyrValGlnSerGlyAspLeuGluThrAlaGluLysValLeuGluLysGluLeu 

48 LeuAlaTrpGlnGluLysLeuHisGlnProIlellelleThrGluTyrGlyVal 

49 AspThrLeuAlaGlyLeuHisSerMetTyrThrAspMetTrpSerGluGluTyr 

50 GlnCysAlaTrpLeuAspMetTyrHisArgValPheAspArgValSerAlaVal 

51 ValGlyGluGlnValTrpAsnPheAlaAspPheAlaThrSerGlnGlylleLeu 

52 ArgValGlyGlyAsnLysLysGlyllePheThrArgAspArgLysProLysSer 

53 AlaAlaPheLeuLeuGlnLysArgTrpThrGlyMetAshPh^GlyGluLysPro 

54 GlnGlnGlyGlyLysGln. 

1 8, HTLV-III envelope protein fragment denoted : 

2 PBl having the following amino-acid sequence, or 

3 mutants thereof: 

4 MetLeuArg 

5 ProValGluThrProThrArgGlulleLysLysLeuAspGlyLeuTrpAlaPhe 

6 SerLeuAspArgGluArgValAlaAspLeuABnGlnSerValGluIleAsnCys 

7 ThrArgProAsnAsnAsnThrArgLysSerlleArglleGli^rgGlyProGXy 
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ArgAlaPheValThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsn 
IleSerArgAlaLysTrpAsnAsnThrLeuLysGlnlleAspSerLysLeuArg 
GluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAsp 
ProGluIleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsn 

Se rThrGlnLeuPheAsnSe rTh rTrpPheAsnSe rThrTrpSe rThrLysGly 
SerAsnAsnThrGluGlySerAspThrlleThrLeuProCysArglleLysGln 
IlelleAsnMetTrpGlnGluValGlyLysAlaMetTyrAlaProProIleSer 
GlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGly 
GlyAsnSerAsnAsnGluSerGluIleArgArgGliiAlaSerArgGluLeuGlu 
PheLeuLysThrLysGlyProArgAspThrProIlePhelleGly. 

9. HTLV-III envelope protein fragment denoted 590 

having the following amino-acid sequence, or mutants 

thereof: MetLeuArgProValGluThr 

ProThrArgGluIleLysLysLeuAspGlyLeuTrpAlaPheSerLeuAspArg 

GluArgValAlaAspLeuAsnGlnSerValGluIleAsnCysThrArgProAsn 

AsnAsnThrArgLysSerlleArglleGlnArgGlyProGlyArgAlaPheVal 

ThrlleGlyLysIleGlyAsnMetArgGlnAlaHisCysAsnlleSerArgAla 

LysTrpAsnAsnThrLeuLysGlnlleAspSerLysLeuArgGluGlnPheGly 

AsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGluIleVal 

ThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeu 

PheAsnSerThrTrpPheAsnSerThrTrpSerThrLysGlySerAsnAsnThr 

GluGlySerAspThrlleThrLeuProCysArglleLysGlnllelleAsnMet 

TrpGlnGluValGlyLysAlaMetTyrAlaProProIleSerGlyGlnlleArg 

CysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnSerAsn 

AsnGluSerGluIlePheArgProGlyGlyGlyAspMetArgAspAsnTrpArg 

SerGluLeuTyrLysTyrLysValValLysIleGluProLeuGlyValAlaPro 
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18 ThrLysAlaLyBArgArgValValGlnArgGluLyBArgAlaValGlylleGly 

19 AlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMetGlyAlaAlaSer 

20 MetThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGln 

21 AsnAsnLeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrVal 

22 TrpGlylleLysGlnLeuGlnAlaArglleLeuAlaValGluArgTyrLeuLys ' 

23 ABpGlnGlnLeuLeuGlylleTrpGlyCysSerGlyLyeLeuIleCysThrThr 

24 Al aValPr oTrpABnAlaSe rTrpSe r AsnLysSe rLeuGluGlnl 1 eTrpAsn [ 

25 AsnMetThrTrpMetGluTrpAspArgGluIleABnAsnTyrThrSerPhePro 

26 IleHisArgSerValMetLeuTy rThrThrProAsnThrTrpValABpAspIle 

27 ThrValValThrHisyalAlaGlnABpCyBAsnHiBAlaSerVaaAspTrpGln 

28 ValValAlaAsnGlyAspValSerValGluLeuArgAapAlaAspGlnGlnVal 

29 ValAlaThrGlyGlnGlyThrSerGlyThrLeuGlnValValAshProHisLeu 

30 TrpGlnProGlyGluGlyTyrLeuTyrGluLeuCyBValThrAlaliyBSerGln 

31 ThrGluCysAspIleTyrProLeuArgValGlylleArgSerValAlaValLys 
.32 G.lyGluGlnPheLeuIleAsnHisLysProPheTyrPheThrGlyPheGlyArg 
33 HisGluAspAlaAspLeuArgGlyLysGlyPheAspAsnValLeuMetValHis 
3 A AspHisAlaLeuMetAspTrpIleGlyAlaAsnSerTyrArgThrSerHiBTyr 

35 ProTyrAlaGluGluMetLeuAspTrpAlaAspGluHlsGlylleValVallle 

36 AspGluThrAlaAlaValGlyPheAsnLeuSerLeuGlylleGlyPheGluAla 

37 GlyAsnLysProLysGluLeuTyrSerGluGluAlaValAsnGlyGluThrGln 

38 GlnAlaHisLeuGlnAlalleLysGluLeulleAlaArgABpLyBABnHisPro 

39 SerValValMetTrpSerlleAlaAsnGluProAspThrArgProGlnGlyAla 

40 ArgGluTyrPheAlaProLeuAlaGluAlaThrArgLyBLeuAspProThrArg 

41 ProlleThrCyBValAsnValMetPheCysAspAlaHiBThrABpThrlleSer 

42 AspLeuPheAspValLeuCysLeuAsnArgTyrTyrGlyTrpTyrValGlnSer 

43 GlyAspLeuGluThrAlaGluLysValLeuGluLyBGluLeuLeuAlaTrpGln 

44 GluLysLeuHiBGlnProIlellelleThrGluTyrGlyValABpThrLeuAla . 
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4 5 GlyLeuHisSerMetTyrThrAspMetTrpSerGluGluTyrGlnCysAlaTrp 

46 LeuAspMetTyrHisArgValPheAspArgValSerAlaValValGlyGluGIn 

47 ValTrpAsnPheAlaAspPheAlaThrSerGlnGlylleLeuArgValGlyGly 

48 AsnLysLysGlyllePheThrArgAspArgLysProLysSerAlaAlaPheLeu 
A 9 LeuGlnLysArgTrpThrGlyMetAsnPheGlyGluLysProGlnGlnGlyGly 
50 LysGln, 

1 10. HTLV-III envelope protein fragment denoted 

2 KH1 having the following amino- acid sequence, or 

3 mutants thereof: 

4 HetLeuArg 

5 ProValGluThrProThrArgGluIleLysLysLeuAspGlyLeuTrpAlaPhe 

6 SerLeuAspArgGluArgGluPheProValTrpLysGluAlaThrThrThrLeu 

7 pheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAla 

8 ThrHisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsn 

9 valThrGluAsnPheAsnMetTrpLysAsnAspMetValGluGlnMetHisGlu 

10 AspIlelleSerLeuTrpAspGlnSerLeuLysProCysValLysLeuThrPro 

11 LeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThrAsnSer 

12 serSerGlyArgMetlleMetGluLysGlyGluIleLysAsnCysSerPheAsn 

13 HeSerThrSerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLys 

14 LeuAspIlelleProIleAspAsnAspThrThrSerTyrThrLeuThrSerCys 

15 AsnThrSerVallleThrGlnAlaCysProLysValSerPheGluProllePro 

• 16 HeHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr - 

17 pheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGly 

18 IleArgProValValSerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGlu 

19 GluValVallleArgSerAlaAsnPheThrAspAsnAlaLysThrllelleVal 

20 GlnLeuAsnGlnSerValGluIleAsnCysThrArgProAsnAsnAsnThrArg 

21 LysSerlleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLys 
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22 ileGlyAsnMetArgGlnAlaBisCyBAsnlleSerArgAlaLysTrpAsnAsn 

23 ThrLeuLyBGlnlleAspSerLysLeuArgGluGlnPheGlyAenAenLysThr 

24 IlellePheLysGlnSerSerGlyGlyAspProGluXleValThrHiBSerPhe 

25 AsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheABnSerThr 

26 TrpPheAanSerThrTrpSerThrLysGlySerABnAsnThrGluGlySerAsp 

27 ThrlleThrLeuProCysArglleLysGlnllelleABnMetTrpGlnGluVal 

28 GlyLysAlaHetTyrAlaProProIleSerGlyGlnlleArgCysSerSerAsn 

29 HeThrGlyLeuLeuLeuThrArgABpGlyGlyAanSerAsnAsnGluSerGlu 

30 IlePheArgProGlyGlyGlyAspHetArgAspAsnTrpArgSerGluLeuTyr 

31 LysTyrLysValValLysIleGluProLeuGlyValAlaProThrLyaAlaLys 

32 ArgArgValValGlnArgGluLysArgAlaValGlytleGlyAlaLeuPheLeu 

33 GlyPheLeuGlyAlaAlaGlySerThrMetGlyAlaAlaSerMetThrLeuThr 
3A ValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsnLeuLeu 

ArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLys 

36 GlnLeuGlnAlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeu 

37 LeuGlylleTrpGlyCysSerGlyLysLeuIleCysThrThrAlaValProTrp 
3g AsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrpAsnAsnMetThrTrp 
3g MetGluTrpAspArgGluIleAsnAsnTyrThrSerPheProGlyAlaArglle 
4Q LeuGluAspGluArgAlaSer . 

1 11. A plasmid selected from the following: 

2 plasmid pREVl , plasmid pREVITT , plasmid pREVlTT/chl , 

3 plasmid pREV2.2, plasmid pRIO , plasmid pPBl, plasmid 

4 p590, and plasmid pKHl, 
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and preferably any of the last five of these eight 
plasmids. 

12.. DNA having the nucleotide sequence defined in any of 
claims 1 to 4 , or an equivalent nucleotide sequence 
containing bases whose translated region codes for 
HTLV-III envelope protein fragment denoted RIO, PBl, 590 
or KH1 . 

13. An immunochemical assay for detecting or quantifying 
antibody against HTLV-III in a fluid, which comprises 
employing an HTLV-III protein selected from RIO, PBl, 590 
and KH1 . 

14. An immunoadsorbent suitable for use in a solid phase 
immunochemical assay for antibody against HTLV-III , which 
comprises a solid phase to which is affixed an HTLV-III 
protein selected from R10 r PBl, 590 and KHl . 

15. An immunoadsorbent according to claim 14, wherein 
the solid phase is a glass or plastic bead, a well of a 
microliter plate or a test tube. 

16. An immunoadsorbent according to claim 14 or claim 
15, which additionally comprises a post-coat of animal 
protein . 

17. A kit suitable for use in detecting antibody against 
HTLV-III in a biological fluid, which comprises: 

(a) an immunoadsorbent according to any of claims 
14 to 16; 

(b) labelled HTLV-III antibody; and 

(c) means for detecting the label associated with 
the immunoadsorbent . 

18. A kit according to claim 17, wherein the 
anti-HTLV-III antibody is labelled with anti- (human IgG) 
antibody. 

19. A method of detecting antibody against HTLV-III in a 
biological fluid, which comprises the steps of: 

(a) incubating an immunoadsorbent according to any 
of claims 14 to 16 with a sample of the biological fluid, 



0255190 



-77- 

under conditions which allow anti-HTLV-III antibody in 
the sample to bind to the immunoadsorbent; 

(b) separating the immunoadsorbent from the sample; 

and 

5 (c) determining if antibody has bound, to the 

immunoadsorbent as an indication of anti-HTLV-III in the 
sample. 

20. A method according to claim 19 , wherein at ep (c) 
comprises incubating the immunoadsorbent witfi (1) a 

10 labelled antibody against antigen of the species from 
which the biological fluid is derived, labelled 
HTLV-III protein selected from RIO, PB1, 590 and KH1, or 
(iii) labelled protein A; separating the immunoadsorbent 
from the labelled antibody, HTLV-III protein or protein 

15 A? and detecting the label associated with the 
immunoadsorbent. 

21. A method of detecting antibody against HTLV-III in a 
human serum or plasma sample, which comprises the steps 
of: 

20 (a) incubating a bead of an immunoadsorbent 

according to any of claims 14 to 16 with the serum or 
plasma sample under conditions which allow anti-HTLV-III 
antibody in the sample to bind to the iirkunoadsorbent; 

(b) separating the immunoadsorbent and the sample; 

25 (c) incubating the immunoadsorbent with a labelled 

anti- (human IgG) antibody under conditions which allow 
the anti- (human IgG) antibody to bind human anti-HTLV-III 
antibody bound to the immunoadsorbent; 

(d) separating the immunoadsorbent from the unbound 
30 anti- (human IgG) antibody; and 

(e) evaluating the label associated with the 
immunoadsorbent as an indication of the presence of 
antibody against HTLV-III in the sample. 

22. A method according to claim 21, wherein the 

35 anti- (human IgG) antibody is an animal antibody and the 
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serum or plasma sample is diluted with normal serum of an 
animal of the same species. 

23. A method according to claim 22, wherein the 
anti-(human IgG) antibody is a goat antibody and the 

5 serum or plasma sample is diluted with normal goat serum. 

24. A method according to any of claims 21 to 23, 
wherein the anti- (human IgG) antibody is labelled with a 
radioisotope, an enzyme or a fluorescent compound. 

25. A vaccine composition which comprises an HTLV-III 
10 protein having the antigenic properties of RIO, PB1, 590 

or KH1, in a pharmacologically-acceptable vehicle. 

26. A recombinant HTLV-III envelope protein fragment 
selected from RIO, PB1, 590 and KH1, for therapeutic use. 

15 
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